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These are lecture notes for a mini-course given at the St. Petersburg School in 
Probability and Statistical Physics in June 2012. Topics include integrable models of 
random growth, determinantal point processes, Schur processes and Markov dynam- 
ics on them, Macdonald processes and their application to asymptotics of directed 
Q polymers in random media. 
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, , Preface 

These lectures are about probabilistic systems that can be analyzed by essentially alge- 
<-] braic methods. 

The historically first example of such a system goes back to De Moivre (1738) and 
Laplace (1812) who considered the problem of finding the asymptotic distribution of a sum 
of i.i.d. random variables for Bernoulli trials, when the pre-limit distribution is explicit, 
t-h and took the limit of the resulting expression. While this computation may look like a 

simple exercise when viewed from the heights of modern probability, in its time it likely 
l/-) served the role of a key stepping stone — first rigorous proofs of central limit theorems 

appeared only in the beginning of the XXth century. 

At the moment we are arguably in a "De Moivre-Laplace stage" for a certain class 
of stochastic systems which is often referred to as the KPZ universality class, after an 
influential work of Kardar-Parisi-Zhang in mid-80's. We will be mostly interested in the 
case of one space dimension. The class includes models of random growth that have built- 
in mechanisms of smoothing and lateral growth, as well as directed polymers in space-time 
^ uncorrelated random media and driven diffusive lattice gases. 

While the class and some of its members have been identified by physicists, the first 
examples of convincing (actually, rigorous) analysis were provided by mathematicians, 
who were also able to identify the distributions that play the role of the Gaussian law. 
Nowadays, they are often referred to as the Tracy-Widom type distributions as they had 
previously appeared in Tracy- Widom's work on spectra of large random matrices. 

The reason for mathematicians' success was that there is an unusually extensive 
amount of algebra and combinatorics required to gain access to suitable pre-limit for- 
mulas that admit large time limit transitions. As we will argue below, the "solvable" or 
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integrable members of the class should be viewed as projections of much more powerful 
objects whose origins lie in representation theory. In a way, this is similar to integrable 
systems that can also be viewed as projections of representation theoretic objects; and this 
is one reason we use the words integrable probability to describe the phenomenon. There 
are also much more direct links between integrable systems and integrable probability 
some of which we mention below. 

The goal of these notes is not to give a survey of a variety of integrable probabilistic 
models that are known by now (there are quite a few, and not all of them are members 
of the KPZ universality class), but rather to give a taste of them by outlining some of the 
algebraic mechanisms that lead to their solution. 

The notes are organized as follows. 

In Section [T] we give a brief and non-exhaustive overview of the integrable members of 
the KPZ universality class in (1+1) dimensions. 

In Section [2] we provide the basics of the theory of symmetric functions that may be 
seen as a language of the classical representation theory. 

In Section [3] we discuss determinantal random point processes — a fairly recent class 
of point processes that proved very effective for the analysis of growth models and also 
for the analysis of integrable probabilistic models of random matrix type. 

Section [1] explains a link between a particular random growth model (known as the 
polynuclear growth process or PNG) and the so-called Plancherel measures on partitions 
that originates from representation theory of symmetric groups. 

In Section [5] we introduce a general class of the Schur measures that includes the 
Plancherel measures; members of these class can be viewed as determinantal point pro- 
cesses, which provides a key to their analysis. We also perform such an analysis in the case 
of the Plancherel measure, thus providing a proof of the celebrated Baik-Deift- Johansson 
theorem on asymptotics of longest increasing subsequences of random permutations. 

Section [6] explains how integrable models of stochastic growth can be constructed with 
representation theoretic tools, using the theory of symmetric functions developed earlier. 

In Section [7] we show how one can use celebrated Macdonald symmetric functions to 
access the problem of asymptotic behavior of certain directed random polymers in (1+1) 
dimensions (known as the O'Connell-Yor polymers). The key feature here is that the 
formalism of determinantal processes does not apply, and one needs to use other tools 
that here boil down to employing Macdonald- Ruijsenaars difference operators. 



1 Introduction 

Suppose that you are building a tower out of unit blocks. Blocks are falling from the 
sky, as shown at Figure [T] (left picture) and the tower slowly grows. If you introduce 
randomness here by declaring the times between arrivals of blocks to be independent 
identically distributed (i.i.d.) random variables, then you get the simplest Id random 
growth model. The kind of question we would like to answer here is what the height h(t) 
of tower at time t is? 

The classical central limit theorem (see e.g. |Billingsley-95 Chapter 5] or 



Kallenberg-02 Chapter 4]) provides the answer: 

h{t) « c^H + ^c^H 
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where c\ and C2 are the mean and standard deviation of the times between arrivals of the 
blocks, respectively, and £ is a standard normal random variable N(0, 1). 



□ t 



□ T 



□ t 



Figure 1: Growth models based on the falling boxes: Id model, 2d model without inter- 
actions, 2d model with sticky boxes (ballistic deposition) 



If blocks fall independently in different columns, then we get a 2d growth model, as 
shown at Figure [T] (middle picture). When there are no interactions between blocks and 
the blocks are aligned, the columns grow independently and fluctuations remain of order 
t 1 / 2 . But what happens if we make blocks sticky so that they get glued to the boxes of 
adjacent columns, as shown at Figure [T] (right picture)? This model is known as ballistic 
deposition and, in general, the answer for it is unknown. However, computer simulations 
(see e.g. |B arabasi-Stanley-95| ) show that the height fluctuations in this model are of 
order t 1 / 3 , and the same happens when the interaction is introduced in various other 
ways. Perhaps, there is also some form of a Central Limit Theorem for this model, but 
nobody knows how to prove it. 

Coming back to the Id case, one attempt to guess the central limit theorem would be 
through choosing certain very special random variables. If the times between arrivals are 
geometrically distributed random variables, then h(t) becomes the sum of independent 
Bernoulli random variables and the application of the Stirling's formula proves the conver- 
gence of rescaled h(t) to the standard Gaussian. (This is the famous De Moivre-Laplace 
theorem.) 

In the 2d case it is also possible to introduce particular models for which we can prove 
something. Consider the interface which is a broken line with slopes ±1, as shown at 
Figure [2] (left picture) and suppose that a new unit box is added at each local minimum 
independently after an exponential waiting time. 

There is also an equivalent formulation of this growth model. Project the interface to 
a straight line and put "particles" at projections of unit segments of slope —1 and "holes" 
at projections of segments of slope +1, see Figure [2] (right picture). Now each particle 
independently jumps to the right after an exponential waiting time (put it otherwise, 
each particle jumps with probability dt in each very small time interval [t, t + dt}) except 
for the exclusion constraint: Jumps to the already occupied spots are prohibited. This 
is a simplified model of a one-lane highway which is known under the name of Totally 
Asymmetric Simple Exclusion Process (TASEP), cf. |Spitzer-70 , |Liggett-85 , |Liggett-99] . 
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Figure 2: Broken line with slopes ±1, local minimum where a box can be added, and 
correspondence with particle configurations on Z. 
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Figure 3: Wedge and flat initial conditions: broken lines and corresponding particle 
configurations. 



Theorem 1.1 ( [ Jo hansson- OOj ) . Suppose that at time the interface h(x;t) is a wedge 
(h(x, 0) = \x\) as shown at Figure^ (left picture). Then for every x E (— 1, 1) 

]imw( h{t ' tX )-^ X)t >-s 

t->°o y C 2 {x)t 1 l A 

where Ci(x), C2(x) are certain (explicit) functions of x. 

Theorem 1.2 ( |Sasamoto-05] . |Borodin-Ferrari-Prahofer-Sasamoto-07| ). Suppose that at 
time the interface h(x;t) is flat as shown at Figure [3] (right picture). Then for every 

where c^, C4 are certain (explicit) positive constants. 

Here F\(s) and ^(s) are distributions from random matrix theory, known un- 
der the name of Tracy- Widom distributions. They are the limiting distributions for 
the largest eigenvalues in Gaussian Orthogonal Ensemble and Gaussian Unitary En- 
semble of random matrices (which are the probability measures with density propor- 
tional to exp(— Trace(X 2 )) on real symmetric and Hermitian matrices, respectively), see 
|Tracy- Widom-9 4 1 , | Tr acy- Widom-96| . 

These two theorems give the conjectural answer for the whole "universality class" of 
2d random growth models, which is usually referred to as the KPZ (Kardar-Parisi-Zhang) 
universality class. Comparing to the answer in the Id case we see that the asymptotic 
behavior becomes more delicate — while scaling by t 1 / 3 is always the same, the resulting 
distribution may also depend on the "subclass" of our model. Also, conjecturally, the only 
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two generic subclasses are the ones we have seen. They are distinguished by whether the 
global surface profile is locally curved or flat near the observation location. 



Let us concentrate on the wedge initial condition. In this case there is yet another 
reformulation of the model. Write in each box of the positive quadrant a random 

"waiting time" w^jy Once our random interface (of type pictured in Figure [2]) reaches 
the box it takes time Wuj\ for it to absorb the box. Now the whole quadrant is filled 
with nonnegative i.i.d. random variables. How to reconstruct the growth of the interface 
from these numbers? More precisely, at what time T(i,j) a given box (i, j) is absorbed 
by the growing interface? A simple argument shows that 



T(i,j) 



(1,1)=6[1]- 



i+j-1 

max w b[k] , 

>6[2]->...6[i+i-l]=(i,i) ^ 



'1.1^ 



where the sum is taken over all directed (leading away from the origin) paths joining 



1, 1) and see Figure El The quantity (1.1) is known as the (directed) Last Passage 



Percolation time. Indeed, if you think about numbers tf(ij) as of times needed to percolate 
into a given box, then (1.1) gives the time to percolate from (1, 1) in (i, j) in the worst 
case scenario. 




Figure 4: The quadrant filled with waiting times and two (out of (^) 
directed paths joining (1, 1) and (4, 1). 



4 possibilities) 



Universality considerations make one believe that the limit behavior of the Last Pas- 
sage Percolation time should not depend on the distribution of wuj) (if this distribution is 
not too wild), but we are very far from proving this at the moment. However, again, the 
Last Passage Percolation time asymptotics has been computed for certain distributions, 



e.g. for the exponential distribution in the context of Theorem |1.1 

Let us present another example, where the (conjecturally, universal) result can be 
rigorously proven. Consider the homogeneous, density 1 Poisson point process in the 
first quadrant, and let L(6) be the maximal number of points one can collect along a 
North-East path from (0,0) to (0,9), as shown at Figure [5] 

This quantity can be seen as a limit of the LPP times when ii>(ij) takes only two values 
and 1, and the probability of 1 is very small. Such considerations explain that L(6) 
should be also in the KPZ universality class. And, indeed, this is true. 
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Figure 5: The Poisson point process in the first quadrant and a North-East path joining 
(0, 0) and (9, 9) and collecting maximal number of points, which is 5 here. 



Theorem 1.3 ( |Baik-Deift- Johansson-99| ) . 



, m fL{9)-29 , 
lim P y >.„ <s)= F 2 {s). 



It is not hard to show that Theorem 1.3 is equivalent to 



Theorem 1.4 ( [Ba ik-Deift- Johansson-9 9j ) . Let a be a uniformly distributed permutation 

of the set {1, . . . , n}, and let E n (a) be the length of the longest increasing subsequence of 

o. Then _ 

nm p ( L - Vn \ 

n^oo \ n 1 ' 6 J 

The problem of understanding the limit behavior of l n has a long history and goes 
back to the book of Ulam of 1961 |Ulam-6l] . Ulam conjectured that E£ n « c^/n but 
was not able to identify the constant; he also conjectured Gaussian fluctuations. In 1974 
Hammersley |Hammersley-72| proved, via a sub-additivity argument, that there exists 
a constant such that £ n c^/n and this constant was identified in 1977 by Kerov and 
Vershik [Vershi k-Kerov-77] . 

The random variable l n has an interesting interpretation in terms of an airplane board- 
ing problem. Imagine a simplified airplane with one seat in each of n rows, large distances 
between rows, and one entrance in front. Each entering passenger has a ticket with a seat 
number, but the order of passengers in the initial queue is random (this is our random 
permutation). Suppose that each passenger has a carry-on, and it takes one minute for 
that person to load it into the overhead bin as soon as (s)he reaches her/his seat. The 
aisle is narrow, and nobody can pass the passenger who is loading the carry-on. It turns 
out that the total time to board the airplane is precisely £ n . Let us demonstrate this with 
an example. 

Consider the permutation a = 2413 with £4(0") = 2. The airplane boarding looks as 
follows: The first passenger enters the airplane and proceeds to the seat number 2. While 
(s)he loads a carry-on, the other passengers stay behind and the one with the ticket for 
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the seat number 1 ((s)he was the third person in the original queue) starts loading her/his 
carry-on. After one minute, the passenger with the ticket for the seat number 4 proceeds 
to his seat and also starts loading, as well as the one aiming for the seat number 3. In 
two minutes the boarding is complete. 

Interestingly enough, if the queue is divided into groups, as often happens in reality, 
then the boarding time (for long queues) will only increase by the factor yk, where k is 
the number of the groups. 

Let us now proceed to more recent developments. In the Last Passage Percolation 
problem we were maximizing a functional H(x) over a set X. A general statistical me- 
chanics principle says that such a maximization can be seen as zero-temperature limit of 
the Gibbs ensemble on X with Hamiltonian —H(x). More formally, we have the following 
essentially obvious statement 

mnxH(x) = lim 4 In Ve" fl(l) . 

xdX /3->oo B ' 

The parameter 3 is usually referred to as the inverse temperature in the statistical me- 
chanics literature. 

In the Last Passage Percolation model, X is the set of all directed paths joining (1, 1) 
with a point (a, b), and the value of H on path x is the sum of u>(i,j) along the path x. The 
Gibbs ensemble in this case is known under the name of a "Directed Polymer in Random 
Media". The study of such objects with various path sets and various choices of noise (i.e. 
W(j.j)) is a very rich subject. 

Directed Polymers in Random Media appeared for the first time close to thirty years 
ago in an investigation of low temperature expansion of the partition function of the Ising 
model with domain wall boundary conditions, see | Huse- Henley-85| , |Imbrie-Spenser-88 , 
but nowadays there are many other physical applications. Let us give one concrete model 
where such polymers arise. 

Consider a set of massive particles in Z that evolve in discrete time as follows. At each 
time moment the mass of each particle is multiplied by a random variable d t , x , where t 
is the time moment and x is the particle's position. Random variables d t)X are typically 
assumed to be i.i.d. Then each particle gives birth to a twin of the same mass and the 
twin moves to x + 1. If we now start at time with a single particle of mass 1 at x = 1, 
then the mass Z(T, x) of all particles at x at time T can be computed as a sum over all 
directed paths (1, 1) = 6[1] ->• b[2\ ->• . . . b[x + T - 1] = (T, x) joining (1, 1) and (T, x): 

x+T-l 

z(t,x)= II d m, (1-2) 

(l,l)=&[l]->6[2]-K..b[x+T-l]=(T,a;) k=l 

This model can be used as a simplified description for the migration of plankton with dt }X 
representing the state of the ocean at location x and time t which affects the speed of 
growth of the population. Independent d t)X model quickly changing media, e.g. due to the 
turbulent flows in the ocean. 

Random Polymers in Random Media exhibit a very interesting phenomenon called 
intermittency which is the existence of large peeks happening with small probability, 
that are high enough to dominate the asymptotics of the moments. Physicists believe 
that intermittency is widespread in nature and, for instance, the mass distribution in the 
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universe or a magnetogram of the sun show intermittent behavior. To see this phenomenon 
in our model, suppose for a moment that d ttX does not depend on t. Then there would 
be locations where the amount of plankton exponentially grows, while in other places all 
the plankton quickly dies, so we see very high peaks. Now it is reasonable to expect that 
such peaks would still be present when dt, x are independent both of t and x and this will 
cause intermittency. Proving and quantifying intermittency is, however, rather difficult. 

Regarding the distribution of Z(T,x), it was long believed in the physics literature 
that it should belong to the same KPZ universality class as the Last Passage Percolation. 
Now, at least in certain cases, we can prove it. The following integrable random polymer 
was introduced and studied by Seppalainen |Seppalainen-12| who proved the £3 exponent 



for the fluctuations. The next theorem is a refinement of this result. 

Theorem 1.5 ( [Bo ro din- Corwin- Remenik- 12] ). Assume dt >x are independent positive ran- 
dom variables with density 



-x " ^exp . 
Y{v) \ x / 

Then there exist 9* > and (explicit) ci, c% > such that for < 9 < 9* , 

™ ( Z(n, n) — c\n \ . . 

lim P 1 ' \ /3 <s)= F 2 (s). 

The upper bound on the parameter 9 > in this theorem is technical and it will 
probably be removed in future works. 

In a similar way to our transition from Last Passage Percolation to monotone paths in 
a Poisson field and longest increasing subsequences, we can do a limit transition here, so 
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that discrete paths in (1.2) turn into Brownian bridges, while d t ^ x turn into the space-time 
white noise. Let us explain in more detail how this works as this will provide a direct link 
to the Kardar-Parisi-Zhang equation that gave the name to the KPZ universality class. 
For a Brownian bridge B = B(s) we obtain a functional 

H(B) = J /3W(s,B(s))ds, (1.3) 

where W is the 2d white noise. Thus, the partition function Z(t,x) has the form 

Z(t,x) = ^exp f-^j E( : exp : (H(B)), (1.4) 

where E is the expectation with respect to the law of the Brownian bridge which starts 
at at time and ends at x at time t, and : exp : is the Wick ordered exponential, 
see |Alberts-Khanin-Quastel-12b| and references therein for more details. Note that the 
randomness coming from the white noise is still there, and Z(t,x) is a random variable. 
Another way of defining Z(t,x) is through the stochastic PDE it satisfies: 

d „, , 1 / d x 2 



m z{t ' x) = 2\ai) z(t ' x) + wz - (L5) 

This is known as the stochastic heat equation. Indeed, if we remove the part with the 



white noise in (1.5), then we end up with the usual heat equation. 
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If the space (corresponding to the variable x) is discrete, then an equation similar to 



(1.5) is known as the parabolic Anderson model; it has been extensively studied for many 



years. 



Note that through our approach the solution of (1.5) with <5 initial condition at time 



is the limit of discrete Z(t,x) of (1.2) and, thus, we know something about it. 



If we now define U through the so-called Hopf-Cole transformation 

Z(x, t) — exp(U(x, t)) 



then, as a corollary of (1.5), U formally satisfies 



d„, , 1 ( d \ 2 _ , f d 



which is the non-linear Kardar-Parisi-Zhang (KPZ) equation introduced in 



Kardar-Parisi-Zhang-86| as a way of understanding the growth of surfaces we started 
with (i.e. ballistic deposition), see |Corwin-TT] for a nice recent survey. 

Due to non-linearity of ( 1.6[ ) it is tricky even to give a meaning to this equation (see, 



however, |Hairer-ll] for a recent progress), but physicists still dealt with it and that's one 
way how the exponent 1/3 of 1 s was predicted. (An earlier way was through dynamical 
renormalization group techniques, see |Forster-Nelson- Stephen- 77| .) 

If we were to characterize the aforementioned results in one phrase, we would use 
"integrable probability". "Integrable" here refers to explicit formulas that can be derived, 
and also hints at parallels with integrable systems. There are direct connections, e.g. y(s) 
defined via 

y 2 (s) = -(\nF 2 (s))" 
solves the (nonlinear) Painleve II differential equation (see |Tracy- Widom-94| ) 

y"(s) = sy(s) + 2y(s) 3 . 

Also if we define 

F(xi, . . . , x n ; t) = E (Z(x lt t) ■ ■ ■ Z(x n , t)) , 



where Z(t,x) is the solution of Stochastic Heat Equation (|1.5|), then 

d 



\i=l x ' i^j / 



where 5 is the Dirac delta-function. (1.7) is known as the evolution equation of the 



quantum delta-Bose gas. It was the second quantum many body system solved via Bethe 
ansatz, see |Lieb-Liniger-63| , |McGuire-64j . 

There is also a deeper analogy: Both integrable systems and integrable probability 
models can be viewed as shadows of representation theory of infinite-dimensional Lie 
groups and algebras. However, while integrable PDEs often represent rather exotic be- 
havior from the point of view of general PDEs, integrable probability delivers universal 
behavior for the whole universality class of similar models. Moreover, in the rare occa- 
sions when the universality can be proved (e.g. in random matrices, see recent reviews 
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|Erdos-Yau-12] . |Tao-Vu-T2] and references therein, or in (1 + l)d polymers in the so- 
called intermediate disorder regime, see |Al berts-Kh anin-Quastel-12a"| ), one shows that 
the generic behavior is the same as in the integrable case. Then the integrable case 
provides the only known route to an explicit description of the answer. 

While we will not explain in these notes the representation theoretic undercurrent in 
any detail, we cannot and do not want to get rid of it completely. In what follows we will 
rely on the theory of symmetric functions which is the algebraic-combinatorial apparatus 
of the representation theory. 
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2 Symmetric functions 

In this section we briefly review certain parts of the theory of symmetric functions. If 
the reader is familiar with the basics of this theory, (s)he might want to skip this section 
returning to it, if necessary, in the future. There are several excellent treatments of 
symmetric functions in the literature, see e.g. |Macdonald-9 5j . |Sa gan-0l| , |Stanley-9~9 



We will mostly follow the notations of [Macdonald-95j and recommend the same book for 
the proofs. 

Our first aim is to define the algebra A of symmetric functions in infinitely many 
variables. Let Ajv = C[xi, . . . , xn] Sn be the space of polynomials in x%, . . . , xn which are 
symmetric with respect to permutations of the Xj. A^r has a natural grading by the total 
degree of a polynomial. 

Let 7TAT+1 : C[xi, . . . , xn+i] — > C[x\, . . . , xn] be the map defined by setting xn+i = 0. 
It preserves the ring of symmetric polynomials and gradings. Thus we obtain a tower of 
graded algebras 

C^A^A^.... 
We define A as the projective limit of the above tower 

A = lim A N = {(/i, / 2 , / 3 , . . . ) | fj e Aj, iTjfj = fj_x, deg(fj) are bounded }. 

An equivalent definition A is as follows: Elements of A are formal power series f(x\,X2, • • • ) 
in infinitely many indeterminates Xi,x 2 , ... of bounded degree that are invariant under 
the permutations of the x^s. In particular, 

Xi + x 2 + x 3 + . . . 

is an element of A, while 

(1+Xi)(l+X 2 )(l + Z3)--- 

is not, because here the degrees are unbounded. 
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Elementary symmetric functions e^, k — 1, 2, . . . are defined by 



ek 



^ ] • • ■ Xi k . 



h<i2<---<ik 

Complete homogeneous functions hk, k = 1, 2, . . . are defined by 



E 



h<i2< — <ik 

Power sums pk, k = 1, 2, . . . are defined by 

i 

Theorem 2.1. The systems {ek}, {hk}, {Pk} are algebraically independent generators of 
A. In other words, A can be seen as the algebra of polynomials in hk, or the algebra of 
polynomials in ek, or the algebra of polynomials inpk-' 

A = C[ei,e 2 ,...] = C[hi,h 2 ,...} = C[pi,p 2 , • • • ]• 

The proof of this statement can be found in [Macdonald-95, Chapter I, Section 2]. 
Theorem 2.1| for the polynomials in finitely many variables is known as the fundamental 
theorem of symmetric polynomials. 

It is convenient to introduce generating functions for the above generators: 

oo oo oo 

H(z):=Y,h k z k , E(z):=Y,e k z k , P(z) := $> fc z fc -\ 



2=0 



2 = 



2=1 



where we agree that ho — eo = 1. 
Proposition 2.2. We have 



i 

E(z) = Y[(l + x i z), 



P(z) 



dz 



Vln^^. 

/ — ' 1 — X;Z 



In particular, 



H{z) 



E(-z) 



exp 



E 

.fe=i 



z Pk 

k 



(2.1) 
(2.2) 

(2.3) 
(2.4) 



Proof. In order to prove (2.2) open the parentheses and compare with the definition of 



efc. To prove (2.1) note that 

II II 



1 + XiZ + xh 2 + xh 3 + . . . 
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and open the parentheses again. Finally, using the power series expansion of the logarithm 

we get 

d v^, 1 d sr^ ( xfz 2 xfz 3 \ 

-r- > hi = — > [XiZ + — h — h . . . 

dz^ I -x^ dz^\ 2 3 J 

oo 

= (Xi + S?Z + X?Z 2 + ...)= ^Pfc^" 1 - 

i k=l 

□ 



Figure 6: Left panel: Young diagram A of size 9 with row lengths (4,3,2) and column 
lengths (3,3,2,1). Right panel: Transposed diagram A'. 

Let A be a Young diagram of size n or, equivalently, a partition of n. In other words, A 
is a sequence Ai > A2 > • • ■ of non-negative integers (which are identified with row lengths 
of the Young diagram), such that Yli A* = |A| = n. The diagram whose row lengths are 
column lengths of A is called transposed diagram and denoted A'. In other words, for each 
i, X' { is equal to the number of j such that Aj > i. We draw Young diagrams as collections 
of unit boxes and Figure [6] gives an example of a Young diagram and its transpose. 

The length £(X) of A is defined as the number of non-zero numbers Aj (equivalently, 
the number of rows in A). Clearly, £(X) = A' x . 

We denote the set of all Young diagrams by Y. By definition Y includes the empty 
partition = (0, 0, . . . ). 

Definition 2.3. The Schur polynomial s\(xi, . . . ,Xn) is a symmetric polynomial in N 
variables parameterized by Young diagram A with £(A) < iV and given by 



det 



X j+ N-j N 

x/ 



s\(x 1 ,...,x N )= n L , — l, \ - ■ (2.5) 

Lli<j\ X i X j) 

One proves that when £(X) < N 

7T N+1 S\(X 1} ...,X N , X N +l) = S\(xi, . . . , X N , 0) = S X (xx, . . . , X N ). 

In addition, 

ne(\)S X (x 1 , . . . ,z/(a)) = 0. 

Therefore, the sequence of symmetric polynomials S\(xi, . . . , x^) with fixed A and varying 
number of variables iV > £(X), complemented by zeros for iV < £(X), defines an element 
of A that one calls the Schur symmetric function s\. By definition s (x) = 1. 
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Proposition 2.4. The Schur functions s\, with X ranging over the set of all Young 
diagrams, form a linear basis of A. They are related to generators and through the 
Jacob i-Trudi formulas: 



s\ = det 



h 



det 



i,j=l,...,£(X) 

where we agree that hk = = for k < 0. 



e A' 



-i+j 



i,j=i,...,i(\>) 



The proof of this statement can be found in [M acdonald-951 Chapter I, Section 3]. 
The expression of s\ through generators pt is more involved and is related to the table 
of characters of irreducible representations of the symmetric groups, see [Ma cdonald^95l 
Chapter I, Section 7]. 

Now suppose that we have two copies of the algebra A or, in other words, two sets 
of variables x = (xi, x 2 , ■ ■ ■ ) and y = (2/1, 2/2, - - • ) • W e can consider functions of the form 
s \( x ) s /i(y), which will be symmetric functions in variables x and y separately, but not 
jointly; formally such function can be viewed as an element of the tensor product A <g> A. 
More generally, we can consider an infinite sum 



^2s x (x)s x (y), 



as an infinite series symmetric in variables Xi,X2,--- and in variables 2/1, 2/2? 



(2.6) 
The 



following theorem gives a neat formula for the sum (2.6). 
Theorem 2.5 (The Cauchy identity). We have 



sx(x u x 2 , . . . )s x (y 1 ,y 2 , ) = JJ 

AeY u iyj 



(2.7) 



and also 

P\(xi,x 2 , 



E 

AeY 



exp 



E 

,fc=i 



p k (x 1 ,x 2 ,. . .)Pk(yi,y2,- ■ ■) 
k 



1 j 



X{Xjj 



(2.1 



where 



Px = P\iPm ■ ■ ■ P\ K x) 
and Z\ = Yli>i i^rn^-, where rrii(X) is the number of rows of length i in A. 



Remark. The right-hand sides of (2.7) and (2.8) should be viewed as formal power series 
via 

1 

1 + XiVj + (xiyj) 2 + (xiyj) 3 + 



1 



1 



Xiyj 



The proof of Theorem 2.5 can be found in |Macdonald-95t Chapter I, Section 4]. In fact, 



(2.7) is a particular case of the more general skew Cauchy identity. We need to introduce 



further notations in order to state it. 
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Take two sets of variables x = (xi,X2, ■ ■ ■) and y = (yi, yn ■ ■ ■ ) and a symmetric 
function / £ A. Let (x, ?/) be the union of sets of variables x and y. Then we can 
view f(x,y) as a function in Xi,yj symmetric with respect to all possible permutations 
of variables. In particular, f(x,y) is a symmetric function in Xi and also a symmetric 
function in yi, more precisely, f(x,y) is a sum of products of symmetric functions of Xi 
and symmetric functions of y^. What does this decomposition look like? The answer, of 
course, depends on /. For instance, 

Pk 0, V) = X i + Y y i = Pk ( X ) + Pk (V) ■ 

i i 

Definition 2.6. Let X be any Young diagram. Expand s\(x,y) as a linear combination 
of Schur symmetric functions in variables y^; the coefficients of this expansion are called 



skew Schur functions and denoted s 



In particular, S\/n(x) is a symmetric function in variables Xi. 

Proposition 2.7 (The skew Cauchy identity). For any Young diagrams X, v we have 

, X2, ■ ■ ■ )s fJi / L/ 

= II 1 - 1 5Z s A/«(?/i> J/2, • • • )s v / K {xx, x 2 ,...). (2.9) 
For the proof of this statement see |Macdonald-95t Chapter I, Section 5, Example 



26]. In order to see that (2.7) is indeed a particular case of (2.9) we need the following 
generalization of Jacobi-Trudi identity (its proof can be found in the same section of 
|Macdonald-"95] l 

Proposition 2.8. Assuming that h k = for k < 0, we have 



det 



h 



Xi—fij—i+j 



i,j=l,...,ma,x(£(X),£(fi)) 

In particular, Sa/ m = unless fi G X, i.e. fa < Aj for all i. 



Comparing Proposition 2.8 with Proposition 2.4 we conclude that if \i = is the 
empty Young diagram, then (one can also see this independently from definitions) 



and also 



' 1, v = 0, 
0, otherwise. 



Now if we set A = v = in (2.9) we get (2.7). 



Another property of the skew Schur functions is summarized in the following proposi- 
tion (its proof can be found in [M acdonald-95t Chapter I, Section 5]). 
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Proposition 2.9. Let x and y be two sets of variables. For any A,/j£Ywe have 

Another important notion is that of a specialization. 

Definition 2.10. Any algebra homomorphism p : A — > C, / t- >■ f(p), is called a spe- 
cialization. In other words, p should satisfy the following properties: 

(f + g)(p) = f(p) + g(p), (fg)(p) = f(p)g( P ), (6f)( P ) = 6f( P ), 6eC. 

Take any sequence of complex numbers w l5 u 2 , . . . satisfying \ u i\ < 00 • Then the 
substitution map A — > C, specialization. More generally, any specialization 

is uniquely determined by its values on any set of generators of A. Furthermore, if the 
generators are algebraically independent, then these values can be any numbers. What 
this means is that defining p is equivalent to specifying the set of numbers pi(p), p2(p), - ■ ■ , 
or the set of numbers e\(p), e2(p),. • • , or the set of numbers hi(p), h 2 (p),. ... In particular, 
if p is the substitution of complex numbers Ui, then 



Note that the condition ^\ \ui\ < 00 implies that the series in (2.10) converges for any 
k > 1. 

Sometimes it is important to know which specializations are positive in a certain sense. 
We call a specialization p Schur-positive if for every Young diagram A we have 

sx{p) > 0. 

There is an explicit classification for Schur-positive specializations. 

Theorem 2.11. The Schur-positive specializations are parameterized by pairs of se- 
quences of non-negative reals a = (a\ > a 2 > ■ ■ ■ > 0) and j3 — > fa > • • ■ > 0) 
satisfying + A) < 00 and an additional parameter 7 > 0. The specialization with 

parameters (a; (3; 7) can be described by its values on power sums 

pi h-> pi(a; (3; 7) = 7 + + A), 

i 

p k h. 0; 7) = + fc > 2 

i 

or, equivalently, via generating functions 

f>(a:;/3; 7)^ = ^7^- 

k=0 i>l 

Remark. One can show that if p is a Schur-positive specialization, then also sa/^(p) > 
for any A, p G Y. 
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Theorem 2.11 has a number of equivalent reformulations, in particular, it is equiv- 
alent to the description of all characters of the infinite symmetric group S(oo) and 
to the classification of totally positive triangular Toeplitz matrices. The first proofs 
of Theorem 2.11 were obtained (independently) by Thoma [T homa-64] and Edrei 
|Edrei-53~] , a proof by a different method can be found in |Vershik-Kerov-8T] . |Kerov-03] . 
[Ke rov-Okounkov-01shanski-98| . and yet another proof is given in |Okounkov-94] . 

Our next goal is to study the simplest Schur-positive specializations more thoroughly. 
Given two Young diagrams A and \x we say that X/fi is a horizontal strip if < A^ — ^ < 1 
for all i; \/ ji is a vertical strip if < \ — //, < 1 for all i. 

A semistandard Young tableau of shape A and rank N is a filling of boxes of A with 
numbers from 1 to N in such a way that the numbers strictly increase along the columns 
and weakly increase along the rows. A standard Young tableau of shape A is a filling of 
boxes of A with numbers from 1 to |A| in such a way that the numbers strictly increase 
both along the columns and along the rows (in particular, this implies that each number 
appears exactly once). Examples of Young tableaux are given in Figure [7J 
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5 


5 


to 


CO 


4 






CO 
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to 


CO 
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6 


8 




7 









Figure 7: Left panel: a semistandard Young tableau of shape (5, 3, 2) and rank 5. Right 
panel: a standard Young tableau of shape (4,3, 1). 

The number of all semistandard Young tableaux of shape A and rank N is denoted as 
Dimjv(A). The number of all standard Young tableau of shape A is denoted as dim(A). 
These quantities have representation-theoretic interpretations, namely, DimTv(A) is the 
dimension of the irreducible representation of unitary group U(N) indexed by (the highest 
weight) A and dim(A) is the dimension of the irreducible representation of symmetric group 
5(| A |) indexed by A. 



The proofs of the following statements are a combination of Theorem 2.11 and results 
of |Macdonald-95l Chapter I]. 

Proposition 2.12. Suppose that a% — c and all other a-, (3-, ^-parameters are zeros. 
Then for Schur-positive specialization (a;/3;j), S\(a;(3;j) = unless X is a one-row 
Young diagram (i.e. £(X) = 1), and, more generally, s>/ At (a; /?; 7) = unless X/fi is a 
horizontal strip. In the latter case 



sa/ m («;/3;7) 



Proposition 2.13. Suppose that {3\ = c and all other a-, (3-, ^-parameters are zeros. 
Then for the Schur-positive specialization (a;/?; 7), sa(«;/3;7) — unless X is a one- 
column Young diagram (i.e. X\ < 1), and, more generally, swj,(a; 0; 7) = unless X/fi is 
a vertical strip. In the latter case 



SA//i(a;/?;7) = c 



IAI-H 
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Proposition 2.14. Suppose that a,\ = ct2 = ■ • ■ = cin — 1 an d all other a-, f3-, 7- 
parameters are zeros. Then for the Schur-positive specialization (a; (3; 7) 

s A (a;/3;7) = Dimjv(A), A G Y. 

Proposition 2.15. Suppose that 7 = and all a- and (3 -parameters are zeros. Then for 
the Schur-positive specialization (a; (3; 7) 

C |A| 

s A (a;/3;7) = — dim(A), A G Y. 



3 Determinantal point processes 

In this section we introduce determinantal point processes which are an important tool in 
the study of growth models and in the KPZ universality class. 

Consider a reasonable "state space" or "one particle space" X, say the real line M, or 
the Euclidean space M. d , or a discrete space such as the set Z of integers or its subset. A 
point configuration X in X is a locally finite (i.e. without accumulation points) collection 
of points of the space X. For our purposes it suffices to assume that the points of X 
are always pairwise distinct. The set of all point configurations in X will be denoted as 
Conf(X). 

A compact subset A C X is called a window. For a window A and X G Conf(X), 
set Na(X) = \A n X\ (number of points of X in the window). Thus, Na is a function 
on Conf(X). We equip Conf(X) with the Borel structure (i.e. cr-algebra) generated by 
functions Na for all windows A. 

A random point process on X is a probability measure on Conf(X). We will often use 
the term particles for the elements of a random point configuration. Thus, we will speak 
about particle configurations. 

The most known example of a random point process is the homogeneous (rate 1) 
Poisson process on M. For any finite interval A C R (or, more generally, for any compact 
set A), the number Na of particles falling in A is finite because, by the very assumption, 
X has no accumulation points. Since X is random, Na is random, too. Here are the key 
properties of the Poisson process (see e.g. |Billingsley-95, Section 23] or |Kallenberg-02 , 
Chapter 10]): 

• iV^ has the Poisson distribution with parameter \A\, the length of A. That is 

I A \n 

F(N A = n) = e HA| M", n = 0,l,2,... 
n\ 

• If Ai,. . . ,Ak are pairwise disjoint intervals, then the corresponding random variables 
Nau ■ ■ ■ , NA k are independent. This means that the particles do not interact. 

The Poisson process can be constructed as follows. Let M — 1, 2, 3, . . . be a natural 
number. Take the interval [— M/2, M/2] and place M particles in it, uniformly and 
independently of each other. Observe that the mean density of the particles is equal to 
1 for any M because the number of particles and the length of the interval are the same. 
Now pass to the limit as M — > 00. As M gets large, the interval approximates the whole 
real line, and in the limit one obtains the Poisson random configuration. 
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Exercise 3.1. Assuming that the limit process exists, show that it satisfies the above two 
properties concerning the random variables Na- 

The above simple construction contains two important ideas: First, the idea of limit 
transition. Starting from M-particle random configurations one can get infinite particle 
configurations by taking a limit. Second, the observation that the limit transition may 
lead to a simplification. Indeed, the structure of the joint distribution of N^, ■ ■ ■ ,NA k 
simplifies in the limit. 

Let us construct a discrete analog of the Poisson process. Replace the real line R by 
the lattice Z of integers. This will be our new state space. A particle configuration on Z 
is simply an arbitrary subset X C Z, the assumption of absence of accumulation points 
holds automatically. 

Fix a real number p G (0, 1). The stationary Bernoulli process with parameter p on 
Z is constructed as follows: For each integer n G Z we put a particle at the node n with 
probability p, independently of other nodes. This procedure leads to a random particle 
configuration. 

Equivalently, the Bernoulli process is a doubly infinite sequence £„, n G Z, of binary 
random variables, such that each £ n takes value 1 or with probability p or 1 — p, respec- 
tively, and, moreover, these variables are independent. Then the random configuration X 
consists of those n's for which = 1. 

The following construction is a simple example of a scaling limit transition. Shrink our 
lattice by the factor of p. That is, instead of Z consider the isomorphic lattice p Z C R with 
mesh p, and transfer the Bernoulli process to p Z. The resulted scaled Bernoulli process 
can be regarded as a process on R because p Z is contained in R, and each configuration 
on pZ is simultaneously a configuration on R. As p goes to 0, the scaled Bernoulli process 
will approximate the Poisson process on the line. This is intuitively clear, because for 
Bernoulli, like Poisson, there is no interaction, and the mean density of particles for the 
scaled Bernoulli process is equal to 1. 

How to describe a point process? The problem here comes from the fact that the space 
of particle configurations Conf(X) is, typically, infinite-dimensional and, thus, there is no 
natural "Lebesgue measure" which could be used for writing densities. One solution is to 
use correlation functions. 

Let us temporarily restrict ourselves to point processes on a finite or countable discrete 
space X (for instance, the reader may assume X = Z). Such a process is the same as a 
collection {£ x } of binary random variables, indexed by elements i£l, which indicate the 
presence of a particle at x. (They are often called occupancy variables.) The Bernoulli 
process was a simple example. Now we no longer assume that these variables are indepen- 
dent and identically distributed; they may have an arbitrary law. Their law is simply an 
arbitrary probability measure P on the space of all configurations. This is a large space, 
it can be described as the infinite product space {0, Thus, defining a point process 
on X amounts to specifying a probability measure P on {0, 1} X . 

Definition 3.2. Let A range over finite subsets ofX. The correlation function of a point 
process X on X is the function p(A) defined by 

p(A) = F(A C X). 
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If A has n points, A = {xi, . . . ,x n }, then we also employ the alternative notation 

p(A) = p n (xi, . . .,x n ). 

In this notation, the single function p splits into a sequence of functions p\, P2, ■ ■ ■ , where 
p n is a symmetric function in n distinct arguments from X, called the n-point correlation 
function. 

Equivalently, in terms of occupation random variables {£ x } 

p n (x U ...,!„)= P(^i = • • • = = !)■ 

For example, for the Bernoulli process one easily sees that p(A) = pl A L 

Exercise 3.3. Show that a random point process on a discrete set X is uniquely determined 
by its correlation function. 

Let us now extend the definition of the correlation functions to arbitrary, not neces- 
sarily discrete state spaces X. 

Given a random point process on X, one can usually define a sequence {p n }n°=iJ where 
p n is a symmetric measure on X n called the nth correlation measure. Under mild con- 
ditions on the point process, the correlation measures exist and determine the process 
uniquely. 

The correlation measures are characterized by the following property: For any n > 1 
and a compactly supported bounded Borel function / on X n , one has 

/ fp n = E I f( X h,---, X in) 
JX " \r V / 

where E denotes averaging with respect to our point process, and the sum on the right is 
taken over all n-tuples of pairwise distinct points of the random point configuration X. 
In particular, for n — 1 we have 

Jj(x) Pl (dx)=E(j^f(x)j 

and pi is often called the density measure of a point process. 

Often one has a natural measure p on X (called reference measure) such that the 
correlation measures have densities with respect to u® n , n — 1,2, ... . Then the density 
of p n is called the nth correlation function and it is usually denoted by the same symbol 

Pn- 

Exercise 3.4. Show that ifX is discrete, then the previous definition of the nth correlation 
function coincides with this one provided that p is the counting measure (i.e. p assigns 
weight 1 to every point of X). 

Exercise 3.5. Show that the nth correlation function of homogeneous rate 1 Poisson 
process (with respect to Lebesgue measure) is identically equal to 1: 

Pn = ^, n>l. 
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If X C R and p is absolutely continuous with respect to the Lebesgue measure, then the 
probabilistic meaning of the nth correlation function is that of the density of probability 
to find a particle in each of the infinitesimal intervals around points x±, x 2 , ■ ■ ■ x n : 

p n {x u x 2 ,... x n )/i(cfei) • • • p(dx n ) 

= P (there is a particle in each interval (xi, Xi + dxj)). 

For a random point process with fixed number of particles, say N, described by a joint 
probability distribution P(dxi, . . . , dx^) (it is natural to assume that P is symmetric with 
respect to permutations of the arguments), the correlation measures for n < N are given 
by 



Pn(dX!, . . .,dx n ) = 



(N-n) 



P(dx±, . . . , dxw)- 



(3.1) 



x„+i,...,xn£X 



Indeed, we have 
/ 



E 



pairwise distinct 



')-/ 

I JX N 

/ 



f(x h ,...,x in )P(dx) 



pairwise distinct 



(N-n)\ J x n 



T / f(x u ...,x n )P(dx). 

! J x n 



For n > N the correlation measures p n vanish identically. 

Another important property of the correlation measures is obtained by considering the 
test functions / given by products of characteristic functions of a window A C X. Then 
one immediately obtains 

E(N A (N A - 1) ■ ■ ■ (N A - n + 1)) = [ p n {dx u ...,dx n ). 

Definition 3.6. A point process on X is said to be determinantal if there exists a function 
K(x,y) on X x X such that the correlation functions (with respect to some reference 
measure) are given by the determinantal formula 

p n (x u ...,x n ) = det[K(xi, Xj)\ n ij=1 

for all n = 1,2, ... . The function K is called the correlation kernel. 

That is, 



Pi(xi) = K(x 1 ,x 1 ), p 2 (x 1 ,x 2 ) = det 
p 3 (x 1 ,x 2 ,x 3 ) = det 



K(xi,x 1 ) K(x 1 ,x 2 ) 
K(x 2 ,x 1 ) K(x 2 ,x 2 ) 



K(x 1 ,x 1 ) K(xi,x 2 ) K(x 1 ,x 3 ), 
K(x 2 ,x 1 ) K(x 2 ,x 2 ) K(x 2 ,x 3 ) 
K(x 3 ,x 1 ) K(x 3 ,x 2 ) K(x 3 ,x 3 ) 



etc. 



Note that the determinants in the right-hand side do not depend on the ordering of 
the arguments Xi. Also the correlation kernel is not unique — gauge transformations of 
the form 

K(x,y)^^K(x,y) 

f{y) 
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with a non-vanishing / : X — > C do not affect the correlation functions. 

The correlation kernel is a single function of two variables while the correlation func- 
tions form an infinite sequence of functions of growing number of variables. Thus, if a 
point process happens to be determinantal, it can be described by a substantially re- 
duced amount of data. This is somehow similar to Gaussian processes, for which all the 
information about process is encoded in a single covariance function. 

Exercise 3.7. Prove that the stationary Poisson process on K and the Bernoulli process 
are determinantal. What are their correlation kernels? 

Determinantal processes appeared in the 60s in the context of random matrix theory 
with first example going back to the work of Dyson |Dyson-62 . As a class such processes 



were first distinguished in 1975 by Macchi |Macchi-75] . who considered the case when 
correlation kernel is Hermitian (K(x,y) = K(y,x)) and called them "Fermion processes". 
In this case 

P2(x,y) = p 1 {x)p 1 {y) - \K(x,y)\ 2 < pi(x)pi(y) 

which, from a probabilistic point of view, means that particles repel. 

The term "determinantal point process" first appeared in |Borodin- Ulshanski-00] to- 
gether with first natural examples of such processes with non-Hermitian kernels. Now 
this term is widespread and there is even a wikipedia article with the same name. 

Nowadays lots of sources of determinantal point processes are known in- 
cluding (in addition to the random matrix theory) dimers on bipartite graphs 
|Kenyon-09| , uniform spanning trees |Lyons-03] , ensembles of non-intersecting paths 
on planar acyclic graphs |Johansson-02| and zeros of random analytic functions 
Hough-Krishnapur-Virag-Peres-lOl - A recent review of determinantal point processes 



can be found in [Bo rodin-10j . 

An important class of determinantal random point processes is formed by biorthogonal 
ensembles. 

Definition 3.8. Consider a state space X with a reference measure \i. An N -point 
biorthogonal ensemble on X is a probability measure on N -point subsets {x\, . . . ,Xn} of 
X of the form 

P N (dxi, . . . , dx N ) = c N ■ det [(f>i{xj)) iJ=1 det [ipi{xj)] i)j=1 ■ f-t(dxi) • • • n{dx N ) 

for a normalization constant c^ > and functions (j>i,il>i, ■ . . on X such that all 

integrals of the form j x (f)i{x)ipj{x)p,{dx) are finite. 

Important examples of biorthogonal ensembles come from the random matrix theory. 
One case is the measure on iV-particle configurations on the unit circle with density 
proportional to 



n 



X% Xj I , 



i<j 

which can be identified with the distribution of eigenvalues of the N x N random unitary 
matrices; here we equip the unitary group U(N) with the Haar measure of total mass 1. 

Another case is the measure on iV-particle configurations on R with density propor- 
tional to 

i<j i 
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which is the distribution of eigenvalues of a random Hermitian matrix from Gaussian Uni- 
tary Ensemble (GUE), see |Mehta-0l] . [Forrester- 10| . [Anderson- Guionnet-Zeitouni- 1 0| . 
[Ak emann-Baik- Francesco- 11] . 

Theorem 3.9. Any biorthogonal ensemble is a determinants point process. Its correla- 
tion kernel has the form 



N 

IJ1 



K(x,y) = Y J H^Ay)\G- t 



where G = [Gij]fj =1 is the Gram matrix: 

Gij = / 4> i (x)tlj j (x)n(dx). 



Remark. When the functions (pi(x) and ipjiv) are biorthogonal, the Gram matrix is diag- 
onal and it can be easily inverted. This is the origin of the term "biorthogonal ensemble". 



Proof of Theorem 3.£\ Observe that 



N i x r / / \iN 

aei [<Pi\Xj)\ 

N 



det [<t>i{ x j)]?,j=i det bPi( x j)]ij=i ' K^i) ' ' ' Kdx N ) 



•N 



N 

= Y[ G a{i)T{j) = N\ det G. 

a,r£S(N) i=l 

This implies that the normalization constant cn in the definition of the biorthogonal 
ensemble above is equal to (AH det and that the matrix G is invertible. 

We need to prove that p n (xi, . . . , x n ) = det[K(xi, ^j)]?j=i f° r an y ti> 1. For n > N 
the statement is trivial as both sides vanish (the right-hand side vanishes because the 
matrix under determinant has rank no more than A" due to the explicit formula for K). 



Assume n < N. By formula (3.1) for the correlation functions, 

PniS^li • • ■ i X ri) 



N\c N 



det [0i(^-)]ff=i det bPi{ x j))u=i p(dx n+1 ) ■ ■ ■ p(dx N ). 



(N-n)\ J X N- n v J"hJ=i --^^-"hj 

Let A and B be two N x N matrices such that AGB 1 = Id. Set 

N N 



<5> k = J2 A ki<Pi, y k = J2 Bkl ^ l > k = l,...,N. 



k=l k=l 

Then 
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On the other hand, for any xi, . . . , x^, 

det [<$>i{ x j)]? >j= i = det A det [(f>i{ x j)}?,j=i , 
det = det B det fofcfo)]£ =1 . 

Also AGB 1 = Id implies det A det B det G = 1. Hence, the formula for the correlation 
functions can be rewritten as 



Pn \p^l j • • • j 



1 



(JV-n)! 



det [$i(xj)]. =1 det . 1 //(dx n+ i) ■ ■ ■ fi{dx N ). 



Opening up the determinants and using the fact that $j's and ^'s are biorthogonal, we 
obtain 

1 - 

p n (xi,...,x„) = — — — y sgn(crr) JJ$ CTW (x i )^ r(i )(x i ) 

^ '' a,reS(N) 1=1 

a-(fc)=r(fc), fc>n 

^ del <I>' J " (let M/" '• . 

1<ji<-<j„<JV 

where "J)- 71 '••• Jn is the submatrix of $ := i=i,..., n formed by columns j\, . . . ,j n , and 

j=l,...,n 

similarly for \[rii)-jn. The Cauchy-Binet formula now yields p n (xi, . . . ,x n ) = det $ , I/ t ) 
and 

AT AT 
k,l,m=l l,m=l 

The right-hand side is equal to K(xi,Xj) because AGB 1 = Id is equivalent to A l B = 

G-K ' ' □ 

The appearance of biorthogonal ensembles in applications is often explained by the 
combinatorial statement known as the Lindstrom-Gessel-Viennot (LGV) theorem, see 
Stembridge-90 and references therein, that we now describe. 



Consider a finite^ directed acyclic graph and denote by V and E the sets of its vertices 
and edges. Let w : E — > C be an arbitrary weight function. For any path 7r denote by 
io(tt) the product of weights over the edges in the path: w(tt) = Yl eew w(e). Define the 
weight of a collection of paths as the product of weights of the paths in the collection (we 
will use the same letter w to denote it). We say that two paths 7Ti and tt 2 do not intersect 
(notation 7i"i fl 7r 2 = 0) if they have no common vertices. 

For any u, v G V, let H(u, v) be the set of all (directed) paths from u to v. Set 

T(u,v)= Yl W W- ( 3 - 2 ) 



lr Thc assumption of finiteness is not necessary as long as the sums in (3.2 1 converge 
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Theorem 3.10. Let (u\, . . . , u n ) and (v i, . . . , v n ) be two n-tuples of vertices of our graph, 
and assume that for any nonidentical permutation a G S(n), 



Then 



{(tTi, . . . ,7T n ) | TTi e II (ui,V a ^) , TXi fl 7Tj = 0, Z,j 



n 



} 



w(vri, . . . , vr n ) = det [TO*, Vj)]" 



7rieII(Mi,?;i),...,7r n eII(it n ,v n ) 



0. 



Theorem 3.10 means that if, in a suitable weighted oriented graph, we have nonin- 
tersecting paths with fixed starting and ending vertices, distributed according to their 
weights, then the distribution of the intersection points of these paths with any chosen 



"section" has the same structure as in Definition |3.8[ and thus by Theorem |3.9| we obtain 
a determinantal point process. More generally, the distribution of the intersection points 
of paths with finitely many distinct "sections" also form a determinantal point process. 
The latter statement is known as the Eynard-Metha theorem, see e.g. |Borodin-Rains-05] 
and references therein. 



A continuous time analog of Theorem 3.10 goes back to |Karlin-Mcgregor-59| , who in 
particular proved the following statement (the next paragraph is essentially a quotation). 

Consider a stationary stochastic process whose state space is an interval on the ex- 
tended real line. Assume that the process has strong Markov property and that its paths 
are continuous everywhere. Take n points X\ < ■ ■ ■ < x n and n Borel sets E\ < • • • < E n , 
and suppose n labeled particles start at x±, . . . ,x n and execute the process simultane- 
ously and independently. Then the determinant det [P t (xi, Ej)]^ =1 , with P t (x,E) being 
the transition probability of the process, is equal to the probability that at time t the 
particles will be found in sets Ei, . . . , E n , respectively, without any of them ever having 
been coincident in the intervening time. 



Similarly to Theorem 3.10, this statement coupled with Theorem 3.9 (or more gener- 
ally, with Eynard-Metha theorem) leads to determinantal processes. 



4 From Last Passage Percolation to Plancherel measure 

The aim of this section is to connect the Last Passage Percolation with certain probability 
measures on the set of Young diagrams often referred to as Plancherel measures for the 
symmetric groups. 



We start from the Poisson process in the first quadrant, as in Theorem L3 and Figure 
§ but now we rotate the quadrant by 45 degrees, like in Figure |4j There is a graphical 
way to find the value of L(6). Namely, for each point of the process draw two rays starting 
from it and parallel to the axes. Extend each ray till the first intersection with another 
ray. In this way, we get a collection of broken lines, as shown in Figure [8j At the first 
intersection points of the rays we put new points that form the second generation. Note 
now that L(6) is equal to the number of broken lines separating (6,9) and the origin. 
As it turns out, it is beneficial to iterate this process. We erase all the points of the 
original Poisson process, but keep the points of the second generation and draw broken 
lines joining them; we repeat this until no points inside the square with vertices (0,0), 
(0, 6), (6, 0), and (6, 6) are left, as shown in Figure |9j Compute the number of broken lines 



separating (6, 6) and (0, 0) at each step and record these numbers to form a Young diagram 
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Figure 8: Points of Poisson process (in black), broken lines joining them and points of the 
second generation (in light blue). Maximum number of points collected along monotonous 
paths joining points (9, 9) (the big green point) and (0, 0) coincides with number of broken 
lines separating them, which is 4 in our case. Only the points inside dashed square matter. 



X(9) = (Xi(6), \2(9), . . . ), so that, in particular, \i(9) = L(8). Observe that \X(9)\ equals 
the number of points of the original Poisson process inside the square with vertices (0,0), 
(0, 9), (6, 0), and (9, 9). The procedure we just described is known as Viennot's geometric 
construction of the Robinson-Schensted correspondence, see e.g. |Sagan-0T . 



Our interest in this construction is based on the fact that the distribution of \(9) can 
be fairly easily computed, as opposed to that of L(9) = Xi(9). 

Theorem 4.1. The distribution of X(6) is given by the Poissonized Plancherel measure 



P(A(9) = jx) 



-^'y) 2 , fi EY. (4.1) 



Sketch of the proof. Note that X(9) depends only on the relative order of coordinates of 
the points inside the square with vertices (0,0), (#,0), (0,6), (9,9), but not on their 
positions. This order can be encoded by a permutation a (9) which is the permutation 
between the points ordered by x coordinate and by y coordinate. Observe that the size 
n(9) of cr(9), i.e. the number of points inside the square, has Poisson distribution with 
parameter 9 2 , and given this number, the distribution of a (9) is uniform. 

Next, we need to use the Robinson-Schensted-Knuth (RSK) correspondence (or rather 
its earlier Robinson-Schensted version), which is an explicit bijection between permuta- 
tions of size n and pairs of standard Young tableaux of same shape. We refer the reader 
to |Sagan-0T , |Fulton-97] . |Knuth-73] for the definition and properties of this correspon- 



dence. What is important for us is that \(9) is precisely the common shape of two 
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Figure 9: Left panel: Points of second generation (in light blue), broken lines joining 
them and points of the third generation (in dark red). Right panel: Points of the third 
generation (in dark red), broken lines joining them and points of the fourth generation 
(in purple). 



obtained tableau. Therefore, given that the size n(6) of cr(6) is m, the number of boxes 
in A (6) is also m, and the conditional distribution of X(6) is 

i (q\ \ dim V) 
= /i | n{6) = m) = : — . 

ml 



Taking into account the Poisson distribution on m, we arrive at (4.1). □ 



Remark. The fact that the RSK correspondence is a bijection implies that 

dim 2 (A) = n\. (4.2) 

[A|=n 

On the other hand, if we recall the definition of dim(A) as the dimension of irreducible 
representation of the symmetric group S(n), then, taking into the account that ^(n)! = 



n\, the equality (4.2) is nothing else but the celebrated Burnside identity which says that 
squares of the dimensions of irreducible complex representations of any finite group sum 
up to the number of the elements in the group. 

Let us now suggest some intuition on why the asymptotic behavior of L(6) should be 
related to those of growth models and, in particular, to the ballistic deposition that we 
started with in Section [Tj Introduce coordinates (z, t) in Figure [8] so that t is the vertical 
coordinate, and consider the following growth model. At time t the height profile is given 
by an integer-valued (random) function h(x,t), at time zero h(x, 0) = 0. At any given 
time t and any point x, the left and right x-limits of the function h(x,t) differ at most 
by 1, in other words, h(x,t) is almost surely a step function with steps (+1) ("up step", 
when we read the values of the function from left to right) and (—1) ("down step"). If 
there is a point of the Poisson process (of Figure [8]) at (z, t), then at time t a seed is born 
at position x — z, which is combination of up and down steps, i.e. h(z,t) increases by 1. 
After that the down step starts moving with speed 1 to the right, while the up step starts 
moving to the left with the same speed. When the next seed is born, another up and down 
steps appear and also start moving. When up and down steps (born by different seeds) 
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meet each other, they disappear. This model is known as Polynuclear Growth (PNG), 
see |Meakin-98j . |Prahofer-Spohn-T)0 , a very nice computer simulation for it is available 
at Ferrari's website [F errari] . and at Figure 10 we show one possible height function. 



* 

Figure 10: Height function of PNG model after the birth of 3 seeds. 

Coming back to Figure |8j note that its broken lines symbolize the space-time trajec- 
tories of up/down steps, while second generation points are identified with collisions of 
up and down steps. In particular, the positions of up and down steps at time t = to are 
the points of intersection of the line t = t at Figure [8] with parts of broken lines of slope 
(— 1) and (+1), respectively. Now it is easy to prove that the PNG-height h(0,t) at time 
t and point is precisely the Last Passage Percolation Time L(t). In order to observe the 
full Young diagram X(8) one should introduce multi-layer PNG model, where a seed on 
level k, k > 2, is born when the up and down steps collide at level k — 1, and the position 
of seed coincides with the position of collision, see |Prahofer-Spohn-02| for details. 

The PNG model is in the KPZ universality class, and obtaining information on its 
asymptotic behavior (roughness of interface, fluctuation distributions) would give us sim- 
ilar (although conjectural) statements for other members of the same universality class. 



5 The Schur measures and their asymptotic behavior 

The aim of this section is to show how the asymptotic behavior of the Poissonized 
Plancherel measure and certain more general distributions on Young diagrams can be 
analyzed. 

Take any two Schur-positive specializations pi, p 2 of the algebra of symmetric functions 
A (those were classified in Theorem 2.11). The following definition first appeared in 
|Ukounkov-0T] . 

Definition 5.1. The Schur measure S pi;P2 is a probability measure on the set of all Young 
diagrams defined through 

PupA } ~ H( Pl ;p 2 ) ' 
where the normalizing constant H(pi,p 2 ) is given by 

H{pi;p 2 ) =expH - 



k 



Remark. The above definition makes sense only if p 1; p 2 are such that 

^2s x (p 1 )sx(p 2 ) < oo, (5.1) 
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and in the latter case this sum equals H(pi,p 2 ), as follows from Theorem 2.5 The 



convergence of (5.1) is guaranteed, for instance, if \pk{pi)\ < Cr k and \pk{p2)\ < Cr k with 



some constants C > and < r < 1. In what follows we assume that this (or a similar) 
condition is always satisfied. 



Proposition 5.2. Let 

rameter 7 = 9, i.e. 



be the (Schur-positive) specialization with single non-zero pa- 
Pi{pe) = 0, Pk(pe) = 0, k > 1. 



Then P pe pe is the Poissonized Plancherel measure (4.1) 



Proof. This is an immediate corollary of Proposition 2.15 



□ 



Our next goal is to show that any Schur measure is a determinantal point process. 
Given a Young diagram A, we associate to it a point configuration X(X) = {Aj — 2 + 1/2} C 
Z + 1/2. This is similar to the correspondence shown in Figures [2j [3] Note that X(\) is 
semi-infinite, i.e. there are finitely many points to the right of the origin, but almost all 
points to the left of the origin belong to X(X). 



Theorem 5.3 ( [Q kounkov-01] ) . Suppose that the A G Y is distributed according to the 
Schur measure S> Pi;P2 . Then X(X) is a determinantal point process on Z + 1/2 with corre- 
lation kernel K(i,j) defined by the generating series 



K(i,j)v l w j 



H(p 1 ;v)H(p 2 ;w- 1 ) 
H{p 2 -v^)H{ Pl -w) 



E 



h— I 3 5 

" 2 ' 2 ' 2 ' 



^ k 



(5.2) 



where 



H{p\ z) = s ^h k (p)z k = exp ^2pk(p) : 



k=0 



Remark 1. If we expand if -functions in the right-hand side of (5.2) into power series 



and multiply the resulting expressions, then (5.2) can be viewed as a formal identity of 
power series. 



Remark 2. There is also an analytical point of view on (5.2). Using the fact that (under 



suitable convergence conditions) the contour integral around zero 




a k z 



dz 



k=—oo 



is equal to a n and also that when \w\ < \v\ we have 



E 



w \ ' 



'vw 



V J v — w ' 



we can rewrite (5.2) as 



K(i,j) 



H(pi,v)H(p2]w *) -Jvw dvdw 
H{p2, v~ l )H{px, w) v — w v t+1 w~i +1 



(5.3) 
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with integration going over two circles around the origin \w\ = Ri, \v\ = R 2 such that 
Ri < R 2 and the functions H(pi\ it), H(p 2 ; are holomorphic in the annulus Rx — e < 
\u\ < R2 + e. In particular, if \pk{pi)\ < Cr k and \pk{p2)\ < Cr k with some constants 
C > and < r < 1, then any r < R\ < R2 < r _1 are suitable. 



We now present a proof of Theorem 5.3 which is due to Johansson [Johans son-Olb] . 
see also |Okounkov-01| for the original proof. 



Proof of Theorem 5J3, We have to prove that for any finite set A = {a±, . . . , a m } C Z+l/2 
we have 

ES\(pi)s x (p2) , . fV rf snm 

A:AcX(A) 

For this it suffices to prove the following formal identity of power series. Let x 
and y = (yi, . . . ) be two sets of variables; then 



Or!,...) 



E 



s\{x)s\(y) 



X:ACX(X) lll< -? 



EL 



det 



J i,j=l 



(5.4) 



where the generating function of K(i,j) is similar to that of K(i,j) but with p\ and 
P2 replaced by x and y, respectively. One shows (we omit a justification here) that it 



is enough to prove (5.4) for arbitrary finite sets of variables x and y, so let us prove it 
for x = (xi, . . . , xjsr), y = (yi, . . . ,1/at). In the latter case the Schur functions s\(x) are 
non-zero only if £(X) < N. Because of that it is more convenient to work with a finite 
point configuration X^(A) = {Xj + N — j}f = i C Z> which differs from X(X) in two ways. 
First there is a deterministic shift by iV— 1/2, this has an evident effect on the correlation 
functions. Second, X(X) is infinite, while Xn(X) is finite. However, the additional points 
of X(X) (as compared to those of X^(A)) are deterministically located and move away to 
—00 and, therefore, they do not affect the correlation functions in the end. 



The definition of Schur functions (2.5) implies that 



sxjxi, . . .,x N )s\(y!, ...,y N ) = 1_ 



det 



x ■ 



Xj+N-j 



1 N 



det 



y. 



Xj+N-j 



1 N 



(5.5) 



Now we recognize a biorthogonal ensemble in the right-hand side of (5.5). Therefore, 



we can use Theorem 3.9 which yields that Xn(X) is a determinantal point process with 
correlation kernel 



N 



where G~f is the inverse-transpose matrix of the N x N Gram matrix 



G, 



if j 



l>0 



1 ^iVj 



We can compute the determinant of G, which is 



det[G 



N 

l 3H,j=l 



det 



1 



X iVj J ij=l 



N 



n 



N ( 



(5.6) 
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This is known as the Cauchy determinant evaluation. One can prove (5.6) directly, see 
e.g. |Krattenthaler-99] . Another way is to recall that in the proof of Theorem 3.9 we 



showed that the determinant of G is the normalization constant of the measure, and we 
know the normalization constant from the very definition of the Schur measure, i.e. by 
the Cauchy identity. 

By the Cramer's rule, we have 



(G- 



k,i 



det[G 



ij\i^k,j^£ 



det[G 



vk,j=i 



Using the fact that submatrices of Gy are matrices of the same type and their determinants 
can be evaluated using (5.6), we get 

(G l )k,i 

We claim that 

#(4,4 



1 - x k yi) Uj^ k ( x k ~ Xj) Uj^tiVj ~ yd 



N 



£*?.'/, l G 



3 ij 



(27d) 



N M 



zy k )(l- vx k ) z h v h 



fe=i 



(z-x k )(v-y k ) l-zv 



dzdv 



(5.7) 
y k , but do 



with contours chosen so that they enclose the singularities at z = x k and v 
not enclose the singularity at zv = 1. Indeed, (5.7) is just the evaluation of the double 

\jw and shifting 4,4 by 

□ 



integral as the sum of the residues. Changing the variables z 



N — 1/2 in (5.7) we arrive at (5.3). 



Applying Theorem 5.3 to the Poissonized Plancherel measure we obtain 

Corollary 5.4 ( |Borodin-Okounkov-Olshanski-00] . [Johansson- 1 aj ) . Suppose that A is 
a random Young diagram distributed by the Poissonized Plancherel measure. Then the 
points of X(X) form a determinantal point process on Z + | with correlation kernel 



Ka 



hi) 



(2rri) 



exp I 9(v 



,-i 



w + w 



-1- 



vw 



dvdw 



with integration over positively oriented simple contours enclosing zero and such that 
\w\ < \v\. 

Our next aim is to study the behavior of Kg(i,j) as 9 — > oo. The argument 
below is due to Okounkov |Okounkov-03| . but the results were obtained earlier in 
|Borodin-Okounkov-Olshanski-00| . |Johansson-01a] by different tools. Let us start from 
the case % = j. Then K(i,i) is the density of particles of our point process or, looking at 
Figure [3j the average local slope of the (rotated) Young diagram. Intuitively, one expects 
to see some non-trivial behavior when % is of order 9. To see that set i — u9. Then Kg 
transforms into 



K e (u9,u9) 



1 



(27ri) 



with 



S(z) 



exp 0(S(v) - S(w)) 



1 — u In z. 



vw dvdw 



w vw 



(5- 



z — z 
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Our next aim is to deform the contours of integration so that 9R(S(i>) — S(w)) < on 
them. (It is ok if dt(S(v) — S(w)) = at finitely many points.) If we manage to do that, 



then (5.8) would decay as 9 — > oo. Let us try to do this. First, compute the critical points 
i.e. roots of its derivative 



of S(z) 



S'( 



l + z- 



uz 



When \u\ < 2 the equation S'(z) 
1 which we denote e u * > . Here < 



= has two complex conjugated roots of absolute value 
satisfies 2 cos(0) = u. Let us deform the contours so 

We 



that both of them pass through the critical points and look as shown at Figure 11 




Figure 11: Deformed contours: w-contour in blue and u>-contour in green. The dashed 
contour is the unit circle and the black dots indicate the critical points of S(z). 



claim that now 3lS(v) < everywhere on its contour except at critical points e ±4< ^, and 
dtS(w) > everywhere on its contour except at critical points e ±% ^ (JR,S(v) = dtS(w) = 
ate**.) To prove that observe that ^ftS(z) = for z on the unit circle \z\ = 1 and compute 
the gradient of ^ftS(z) = diS(a + hi) on the unit circle (i.e. when a 2 + b 2 = 1): 

MS(a + 6i) = a - - % ln(a 2 + b 2 ), 



V^S(a + bi) -- 
= (1 



1 - 



au 



2ab 



bu 



a 2 + b 2 ) 2 a 2 + b 2 ' (a 2 + b 2 ) 2 a 2 + b 2 
b 2 + a 2 — au , 2ab — bu) = (2a 2 — au , 2ab — bu) 



(2a-u)(a,b). (5.9) 



Identity (|5.9|) implies that the gradient vanishes at points e ^, points outwards the unit 



circle on the right arc joining the critical points and points inwards on the left arc. This 
implies our inequalities for ^tS(z) on the contours. (We assume that the contours are 
fairly close to the unit circle so that the gradient argument works.) 
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Now it follows that after the deformation of the contours the integral vanishes as 
9 — > oo. Does this mean that the correlation functions also vanish? Actually, no. The 
reason is that the integrand in (5.8) has a singularity at v = w. Therefore, when we 
deform the contours from the contour configuration with \w\ < \v\, as we had in Corollary 
5.4, to the contours of Figure 11 we get a residue of the integrand in (5.8) at z = w along 
the arc of the unit circle joining e ±% ^ . This residue is 

i r e * dz _ 

2ni J e -z<t> z 7r 
We conclude that if u = 2cos(0) with < 4> < it, then 



lim Kg(u9, u6 

8— >oo 
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Turning to the original picture we see that the asymptotic density of particles at point 
% changes from when i m 29 to 1 when % —29. This means that after rescaling 
by the factor 9~ 1 times the Plancherel-random Young diagram asymptotically looks like 
in Figure 12 This is a manifestation of the Vershik-Kerov-Logan-Shepp limit shape 



theorem, see |Vershik-Kerov-77] . |Logan-Shepp- 77 . 




Figure 12: The celebrated Vershik-Kerov-Logan-Shepp curve as a limit shape for the 
Plancherel random Young diagrams. 



More generally, what happens with Kg(i,j) when % = u9 + x, j = u9 + y and 9 — > oo? 
In other words, we want to study how the point configuration (or the boundary of the 
random Young diagram A) behaves in the limit locally near a "bulk point". One proves 
the following theorem. 

Theorem 5.5 ( [B orodin-Okounkov-Olshanski-00] ) . For any —2<u<2 and any two 
integers x, y we have 

sin(0(a; - y)) 

lim K e ([u9\ + x, [u9\ + y) = { M 71 ( x ~ V) (5.10) 

otherwise, 



9— >oo 



TT 



where = arccos(w/2). 
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Remark. The right-hand side of (5.10) is known as the discrete sine kernel and it is 



similar to the continuous sine kernel which arises as a universal local limit of correlation 
functions for eigenvalues of random Hermitian (Wigner) matrices, see e.g. |Erdos-Yau-12| . 
|Tao-Vu-l~2] and references therein. 



Proof of Theorem 5.5. The whole argument remains the same as in the case x — y — 0, 



except for the computation of the residue which is now 



1 

27ri 



,i<t> 



dz sin(0(x — y)) 



^ z x ~y+ 1 tt(x — y) 

We conclude that if u = 2cos(0) with < (p < ir, then 

-rs /i „i , \ sm(d)(x — y)) 

hm K e {[u9\ + x, [u9\ + y = — ^ 

e^oo ' 7r(x — y) 



(5.11) 



□ 



So far we got some understanding on what's happening in the bulk, while we started 
with the Last Passage Percolation which is related to the so-called edge asymptotic be- 
havior, i.e. limit fluctuations of Ai. This corresponds to having u = 2, at which point 
the above arguments no longer work. With some additional efforts one can prove the 
following theorem: 



Theorem 5.6 QBo rodin-Okounkov-Olshan ski-OOj . [Jo hansson-Olaj ) . For any two reals x, 
y we have 

lim 9 1/3 K e (26 + a# /3 , 26 + y6 1/3 ) = K Airy (x, y) (5.12) 

9— >oo 



where 



K 



Airy 



l 



(2rri) 




v3/3-iS3/3+vx-iSy dvdw 



v — w ' 



(5.13) 



with contours shown at the right panel of Figure 13 



Remark 1. Theorem 5.6 means that the random point process X(X) "at the edge", after 
shifting by 29 and rescaling by # 1//3 , converges to a certain non-degenerate determinantal 
random process with state space IR and correlation kernel K Airy 



Remark 2. As we will see, Theorem |5.6| implies the following limit theorem for the Last 
Passage Percolation Time Ai: For any s G R 



lim P(Ai < 29 + s9 1/3 ) = det(l - K Airy (x,y)) 



L2(s,+oo)- 



One shows that the above Fredholm determinant is the Tracy-Widom distribution ^(s) 
from Section [TJ see |Tracy-Widom-94| . 



Proof of Theorem 5.6, We start as in the proof of Theorem 5.5 When u = 2 the two 



critical points of S(z) merge, so that the contours now look as in Figure 13 (left panel) 



and the integral in (5.11 ) vanishes. Therefore, the correlation functions near the edge tend 



to 0. This is caused by the fact that points of our process near the edge rarify, distances 
between them become large, and the probability of finding a point in any given location 
tends to 0. 
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Figure 13: Contours for the edge-scaling limit {u = 2). Left panel: f-contour in blue 
and w-contour in green. The dashed contour is the unit circle. Right panel: limiting 
contours. 



In order to see some nontrivial behavior we need rescaling. Set 

v = l + 6~ 1/3 v, w = l + 0- 1/3 w 

in the contour integral. Note that z = 1 is a double critical point of S(z) = z—z~ 1 —2 \n(z) 
so that in the neighborhood of 1 we have 

S(z)= 1 -(z-l) 3 + 0((z-iy) 



Now as 9 — > oo we have 

exp f 6(S(v) - S(w))} = exp ( 6 



-1/3^3 



WY +0(1 



exp ( -v 



~-w 3 



We conclude that as 9 — > oo 



e- 1 ' 3 

(2vri) 2 




^v 3 /3-w 3 /3+vx-wy dvdw 



V — w' 



(5.14) 



and the contours here are contours of Figure 13 (left panel) in the neighborhood of 1; 
they are shown at the right panel of Figure 13 □ 



Using Theorem |5.6| we can now compute the asymptotic behavior of the Last Passage 
Percolation time, i.e. of Ai- Using the inclusion-exclusion principle, for any AeMwe 
have 



i < A) = 1 - ^2pi{x) + y P^ x >y) ~ l\ Yl Ps{x,y,z) + 



(5.15) 



x>A x,y>A x,y,z>A 

Recall that correlation functions pk are k x k determinants involving kernel Kg, substitute 
A = 29 + s6 l l 3 and send 9 — > oo. The sums in (5.15) turn into the integrals and we get 
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(ignoring convergence issues, which can, however, be handled) 



lim P(Ai < 29 + s9 l/3 ) = 1 - / K Airy (x, x)dx 

Jx>s 

K A i ry (x,x) K A i ry (x,y) 

K Airy {y,x) K Airy (y,y) 



+ I det 



2! 



dxdy — . . . . 



' x,y>s 

In the last expression one recognizes the Fredholm determinant expansion (see e.g. |Lax-02j 
or |Simon-0"5] ) for 

det(l - K Airy (x,y)) L2{Si+00 ). 

The conceptual conclusion from all the above is that as soon as we have an integral 
representation for the correlation kernel of a point process, many limiting questions can 
be answered by analyzing these integrals. The method for the analysis that we presented 
is, actually, quite standard and is well-known (at least since the XIX century) under 
the steepest descent method name. In the context of determinantal point processes and 
Plancherel measures it was pioneered by Okounkov and we recommend |Okounkov-03] for 
additional details. 

6 The Schur processes and Markov chains 

While in the previous sections we gave a few of tools for solving the problems of proba- 
bilistic origin, in this section we present a general framework, which produces "analyzable" 
models. 



6.1 The Schur process 

The following definition is due to [Okounkov- R esheti khin-01] . 

Definition 6.1. The Schur process (of rank N) is a probability measure on sequences of 
Young diagrams pS 1 ', \( 2 \ p^ 2 \ . . . , p^ N ~ l \ X^ N \ parameterized by 2N Schur-positive 
specializations pjj~ , . . . , p%_ v pj, ■ ■ ■ ,pjf and given by 

= ^ A (i)(p ( [)s A (i) /M (i)(p 1 ')s A (2) /M (i)(p^) • • • S A (W) / ^-i)(p^_ 1 )s A (jv)(p] v ), (6.1) 

where Z is a normalization constant. 



Proposition 2.8 implies that in Definition 6.1 almost surely 



It is convenient to use the graphical illustration for the Schur process as shown in Figure 
II 

Note that if we set iV = 1 in the definition of the Schur process then we get back the 
Schur measure of Definition 15 . 11 



35 



Figure 14: Graphical illustration for the Schur process with N = 3. 



Let us introduce some further notations. For two Schur-positive specializations p, p' 

we set 

Pk(p)Pk(p') 



H(p; p') = exp £ 



k 



Given two specializations p 1 , p 2 , their union (p 1; p 2 ) is defined through its values on power 
sums pk 

Pk(pUp2) =Pk(pl) +Pk(p2)- 



Theorem 2.11 implies that if p\ is a Schur-positive specialization with param- 
eters (a (1) ,/3 (1) ,7 (1) ), and p 2 is a Schur-positive specialization with parameters 
(a^ 2 \ (3^ 2 \ 7 1 - 2 - 1 ), then (pi,p 2 ) is a Schur-positive specialization with parameters 
(a^ [j a^, f3 { - 1 "> |J 7^ + 7^), where |J stands for the sequence obtained by 
rearranging the union of sequences and in decreasing order (and similarly for 
0). In particular, if p\ and p 2 specialize symmetric functions by substituting sets of vari- 
ables, say (x\, x 2 , . . .) and (yi, y 2 , ■ ■ ■ ) (which corresponds to zero and 7), then (p 1; p 2 ) 
substitutes all the variables (37, x 2 , ■ ■ ■ , yi, y 2 , ■ ■ ■ )■ 

The definition implies that for specializations p±, . . . , p[, . . . , p' m we have 

A; m 

H(px, ■ ■ ■ , p k ; p[, ■ ■ ■ , p'J = YlYl H (P» Pi)- 

1=1 j=l 

Proposition 6.2. Suppose that for every i < j we have H(pf; pj) < 00. Then the Schur 
process is well-defined and the normalization constant Z in its definition is 

Z = Y[H(pp,pj). 

i<j 

Proof. The proof is based on the iterated applications of identities 

s fl /x(p)s II /u{p) = H{p; p) Y s x/K (p')s u/K {p) 

and 

S\/^P,p) = ^2s X /u{p)s v /^{p'). 

The above identities are valid for any specializations p, p' such that all the sums are con- 
vergent and are just the results of the application of these specializations to the statements 



of Propositions 2.7 and 2.9 
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We have: 



s a(d (Po ) s a(d / M d) (pr) s A(2) / M (D (Pi~) • • • S\w /„("-D (Pjv)«aW (Pjv) 
= #(a}"; PD XI s is/»(pT) s vQ)iv(pt) s \w/vPi (Pi) ■ ■ ■ Saw/^-i) (j>n) 8 \w (Pjv) 
= #(Po ; PF) X s 0/^(Pr) s A(2)/ ! ,(p ( [,p|) • ■ ■ s A w/ At ( JV - 1 )(Pjv) s Aw(P^) 

= #(Po ; PF) X s a(2) (Po > Pi") • • • s aW/ m c*-i) (p^)s A (iv) (pjf), (6.2) 

where we used the fact that s / v = unless A^ = 0, and s / = 1. Note, that the 
summation in the last line of (6.2) runs over A^, jjS 2 \ . . . , /z^ -1 ), \^ N \ i.e. there is no 



summation over li^ anymore. Iterating this procedure, we get the value of the 
normalization constant Z. □ 

It turns out that "one-dimensional" marginals of the Schur processes are the Schur 
measures: 

Proposition 6.3. The projection of the Schur process to the Young diagram X^ k > is the 
Schur measure S Pi;P2 with specializations 

Pi = (Po>Pi h ) --->Pfc-i) ) P2 = (Pk'Pk+V-'PN)- 



Proof. The proof is analogous to that of Proposition 6.2 □ 



Proposition CL3 means that the projection of a Schur process to the Young diagram 
can be identified with a determinantal point process and, thus, can be analyzed with 
methods of Section|5} In fact, a more general statement is true: The joint distribution of all 
Young diagrams of a Schur process is also a determinantal point process with correlation 



kernel similar to that of Theorem 5.3 see |Okounkov-Reshetikhin-0T] . |Borodin-Rains-05] . 

Note that if one of the specializations, say is trivial, i.e. this is the Schur positive 
specialization (0;0;0), then (since Sa/ a1 (0;0;0) = unless A = fx) two of the Young 
diagrams should coincide, namely fi^ = A^ fe+1 ^. In this case we can safely forget about 
jjS^ and omit it from our notations. This also shows that in the definition of the Schur 



process we could replace the saw-like diagram of Figure 14 by any staircase-like scenario. 
Let us give two examples of the Schur processes. 

6.2 Example 1. Plane partitions 

A plane partition Y is a 2d array of non-negative integers Y^, i,j = 1, 2, ... , such that 
Ylij Yij < 00 an d the numbers weakly decrease along the rows and columns, i.e. if %' > % 
and f > j, then > Yi'j/. The sum Ylij Yij * s called the volume of the plane partition 
Y. In the same way as ordinary partitions were identified with Young diagrams in Section 
[2| plane partitions can be identified with 3d Young diagrams. To see that, view a plane 
partition as a collection of numbers written in the vertices of the regular square grid on 
the plane and put k unit cubes on each number k. The resulting 3d body is the desired 



3d Young diagram; an example is shown in Figure [15 

Fix q, < q < 1 and consider the following probability measure on the set of all plane 
partitions 

poo = ^<r lume(y) , 
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Figure 15: Left panel: Plane partition of volume 5. Right panel: The corresponding 3d 
Young diagram. 



which is one of the simplest (speaking of definition, not properties) possible probability 
measures on this set. The normalization constant M is given by the celebrated MacMahon 
formula (see |MacMahon-T9i~2] . |Stanley-99| Section 7.20], |Macdonald-95l Chapter I, 



Section 5, Example 13]) 

oo 

M = JJ(1 -q n )~ n . 

71=1 

We claim that the above measure can be described via a Schur process. In fact this is 
a particular case of a more general statement that we now present. 

Definition 6.4. Fix two natural numbers A and B. For a Young diagram n C B A = 
(B, . . . ,B), set ft = B A /n. A skew plane partition II with support ft is a filling of all 

A times 

boxes of ft by nonnegative integers Iljj (we assume that Iljj is located in the ith row and 
jth column of B ) such that ILjj > Rij+i and ELjj > U-i+ij for all values ofi,j. The 
volume of the skew plane partition U is defined as 



volume(n) = II 



1,3 



For an example of a skew plane partition see Figure 16 

Our goal is to explain that the measure on plane partitions with given support 
ft and weights proportional to g volumo ( ) ) < q < 1, is a Schur process. This fact 
has been observed and used in |Ukounkov-Reshetikhin-0 1j . [Ok ounkov-Reshetikhin-05] . 
|Okounkov-Reshetikhin-06] . |Boutillier-Mkrtchyan-Reshetikhin-Tingley-12| , |Borodin-ll] . 

The Schur process will be such that for any two neighboring specializations p^,p\ at 
least one is trivial. This implies that each coincides either with or with A^ +1 -'. 
Thus, we can restrict our attention to A^'s only. 

For a plane partition EE, we define the Young diagrams AW (\<k<A + B + l) via 

AW(n) = {Jhw-A-i | (i,i + k-A-l) eft}. 



Note that A^ 1 -* = \( A+B+1 *> = 0. Figure 16 shows a skew plane partition II and corre- 
sponding sequence of Young diagrams. 

We need one more piece of notation. Define 

£(tt) = {A + iTi - i + 1 \i = l,...,A}. 
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Figure 16: A skew plane partition II with support B A /n. Here B = A = 4 and n = 
(2,1,1). The corresponding sequence of Young diagrams is C (6) D (5) C (8,4) C 
(10,5,3) D (7,3) C (9,6) D (8) D 0. 



This is an A-point subset in {1, 2, . . . , A + B}, and all such subsets are in bijection with 
the partitions n contained in the box B A ; this is similar to the identification of Young 
diagrams and point configurations used in Theorem 5.3 The elements of C(tc) mark the 
up-right steps in the boundary of 7r (=back wall of II), as in Figure 16 



Theorem 6.5. Let n be a partition contained in the box B A . The measure on the plane 
partitions H with support ft and weights proportional to the Schur process with 

N = A + B + 1 and Schur-positive specializations {p^}, {pj} defined by 



Po = Pn 
(<r 5 ';0;0), je£(7r), 
(0;0;0), J(£C(n); 



(0;0;0) 



(0;0;0), jeC(n) 
(<7*;0;0), j£C(n) 



(6.3) 
(6.4) 



where (a; 0; 0) is the Schur-positive specialization with single non-zero parameter ot\ 



a. 



Remark. One can send A, B to infinity in Theorem (6.5). If n = 0, then we get plane 
partitions (3d Young diagrams) we started with. 

Proof of Theorem 6.5, Definition |6.4 and Proposition 2.12 imply that the set of all skew 
plane partitions supported by W, as well as the support of the Schur process from the 
statement of the theorem, consists of sequences (A«,A( 2 ),...,AW) with 

aw = A^ = 0, 
A (i) ^ A (i+i) if j e C (x), x U) y X u+1) if j i C(X), 

where we write p -< v or v >~ p if V\ > p\ > > P2 > 



On the other hand, Proposition 2.12 implies that the weight of (A^, A^, . . . , A^) 



with respect to the Schur process from the hypothesis is equal to q raised to the power 

A+B 

I A(J) | \ (i ~ tfh-iecfr) - U - +J 1 jec(n) +jl j( fc(n) 

i=2 
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where the four terms are the contributions of PjLx, Pj-i, Pj > Pj , respectively. 

Clearly, the sum is equal to Y^=? I | = volume(II) . □ 



Theorem 6.5 gives a way for analyzing random (skew) plane partitions via the 
approach of Section [5j Using this machinery one can prove various interest- 
ing limit theorems describing the asymptotic behavior of the model as q — > 1, 
see |Okounkov-Reshetikhin-0T] . |Okounkov-Reshetikhin-05] . |Okounkov-Reshetikhin-06] . 
Boutillier-Mkrt chyan- Reshetikhin-Tingley- 1 2 1 . 



6.3 Example 2. RSK and random words 

Our next example is based on the Robinson-Schensted-Knuth correspondence and is a 
generalization of constructions of Section |4} 

Take the alphabet of N letters {1,...,N} and a collection of positive parameters 
ai,...,ajv- Consider a growth process of the random word Wjv(t) with each letter j 
appearing (independently) at the end of the word according to a Poisson process of rate 
dj. In particular, the length |a>jv(£)| of the word at time t is a Poisson random variable 
with parameter (g^ + ■ ■ ■ + ajy)t: 

P(K(t)| — k) — e-^^ ((ai + ,, M + ^ f 

k\ 

The growth of Wjv(0 can be illustrated by the random point process with point (t,j) 
appearing if the letter j is added to the word at time t, see Figure [TTj For any T > 0, 
one can produce a Young diagram \( N \T) from all the points (t, j) with t < T using 
the Robinson-Schensted-Knuth (RSK) algorithm, whose geometric version was given in 
Section El (We again address the reader to |Sagan-01 , [Ful ton-97j . |Knuth-73] for the 



details on RSK.) In particular, the length of the first row of X^ N \T) equals the maximal 
number of points one can collect along a monotonous path joining (0,0) and (T,N), as 



in Figure 17. More generally, let WN_k(t) be the word obtained from w^ft) by removing 
all the instances of letters N, N — 1, . . . , N — k + 1, and let \( N ~ k \T) denote the Young 
diagram corresponding to W]y^k(t), i.e. this is the Young diagram obtained from all the 
points (t, j) with t < T, j < N — k. 

Proposition 6.6. For any t and N the collection of (random) Young diagrams 
A (1) (t), A (2) (t), . . . , \ {N \t) forms a Schur process with probability distribution 

— s A (i)(ai)s A (2)/ A (i)(a 2 ) • • • S\(N)/\(*r-i)(a>N)s\(N)(pt), 

where we identify ai with the Schur-positive specialization with parameter oti = and all 
other parameters 0, and p t is the specialization with single non-zero parameter j — t. 

Proof. This statement follows from properties of RSK correspondence, cf. |Johansson-05] . 

□ 

Now let us concentrate on the random vector (£i(t), . . . , ^Ar(t)) = (A^ (t), . . . , A^ (t)) 



and try to describe its time evolution as t grows. Recall that in Figure 17 the value of 
£k{T) is the maximal number of points collected along monotonous paths joining (0,0) 
and (T, k). Suppose that at time t a new letter k appears, so that there is a point (t, k) 
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Figure 17: Collection of points with coordinates (t,j) corresponding to the growth of 
random word and the path collecting maximal number of points £jy(T) = A^(T) = 5. 
Here N = 5. 



in the picture. It means that £k grows by 1 (£k{t) = £k{t—) + 1), because we can add 
this point to any path coming to any (t f , k) with t' < k. Clearly, £j does not change for 
j < k. Note that if £k+i(t—) > £^(t—), then £^ + x also does not change, since the maximal 
number of points collected along a path passing through (t, k) is at most £k{t). Finally, 
if £k(t—) = £k+i(t—) = ■ ■ ■ = £k+m(t~), then all the numbers £^, . . . , £k+m should increase 
by 1, since the optimal path will now go through (t, k) and collect £k points. 

The above discussion shows that the evolution of (£x(t), . . . ,£^(t)) is a Markov pro- 
cess, and it admits the following interpretation. Take iV (distinct) particles on Z with 
coordinates £\ + 1 < £% + 2 < ••• < £^ + N. Each particle has an independent exponential 
clock of rate <Zj. When a clock rings, the corresponding particle attempts to jump to the 
right by one. If that spot is empty then the particle jumps and nothing else happens. 
Otherwise, in addition to the jump, the particle pushes all immediately right adjacent 



particles by one, see Figure 18 for an illustration of these rules. The dynamics we have 
just described is known as the Long Range Totally Asymmetric Exclusion Process, and 
it is also a special case of PushASEP, see [B orodin- Ferrari-08a] . |Spitzer-70 . 



o • • o o • o ► •o«o#o»o 



•o««oo«o ► •oo««o«o 

Figure 18: PushASEP. Top panel: jump of a particle. Bottom panel: jump of a particle 
which results in pushing. 

We also note (without proof) that if instead of (A^ (t), . . . , \[ N \t)) one considers the 
random vector (Xi(t), \% 2 (t) . . . , X^\t)), then the evolution of the particles — N < 
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(N-l) 
N-l 



(N - 1) < • • • < A? } - 1 is the well-known TASEP process that was presented 



A 

in Section Q] 

The conclusion now is that Proposition Gi) together with methods of Section [5] gives 
a way of the asymptotic analysis of TASEP and PushASEP stochastic dynamics at large 
times. In particular, one can prove the analogues of Theorems 
[Jo hansson-00] . |Borodin-Ferrari-08a] . 



1.3, 1.4, 1.5 



see e.g. 



6.4 Markov chains on Schur processes 

In the last section we linked Schur processes to simple particle dynamics like TASEP 
using the RSK correspondence. In this section we produce another family of Markov 
dynamics connecting such objects whose description is arguably more straightforward 
and independent of complicated combinatorial algorithms, such as RSK. 

We start by introducing a general framework. Let p and p' be two Schur-positive 
specializations such that H(p; p') < oo. Define matrices p\^^ and p\_+„ with rows and 
columns indexed by Young diagrams A and p as follows: 



t 



(p;p) 



l 



s m(p) 



H(p;p')s x (p) 



Wa(p') 



4- 

Px- 



t (p;p') 



'A/ M (p')- 



(6.5) 
(6.6) 



Proposition 6.7. The matrices p\_+„ and Px_+ U are stochastic, i.e. all matrix elements 



are non-negative, and for every A G Y we have 



5>l 



5>i 

A*eY 



Xpip) 



1. 



(6.7) 



(6.8) 



Proof. Non-negativity of matrix elements follows from the definition of Schur-positive 



Proposition 2.7, and equality 



specializations. Equality (|6.7|) is a specialized version of the skew Cauchy identity of 

□ 



6.8|) is a specialized version of Proposition 2.9 
and p\ 



Proposition 6.7 means that matrices p\ 



can be viewed as transitional 
probabilities of Markov chains. Definitions (6.5), (6.6) and Proposition 2.8 imply that 



t 
Pa- 



= unless p D A, i.e. the Young diagram increases, and p^_>.„ = unless p C A, i.e. 



the Young diagram decreases (hence up and down arrows in the notations). 

One of the important properties of the above stochastic matrices is that they agree 
with Schur measures, i.e. Markov chains defined using them preserve the class of Schur 
measures. Formally, we have the following statement: 



Proposition 6.8. For any p 6 Y we have 

$^§pi;p 2 0)pL m (P2;P3) 



(P) 



AeY 



and 



AeY 



P2,P3 ( a )pa^(p2;ps) 



"pr,p2 



(aO- 
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Remark. Informally, increases the specialization by adding p 3 , while decreases 
the specialization by removing p 3 . Note also that S Pi;P2 = §p 2 ;pi f° r an y Pii Pi- 



Proof of Proposition 6.8 This is an application of the specialized versions of Propositions 
OandU □ 

Next, note that the distribution of the Schur process of the form which appeared in 



Proposition 6.6 



can be rewritten as 



Po>->Pn-1'P 



A(iV)^ A (iv- 



-i) (Po > ■ ■ ■ > PjV-2) Pn-i) ' ' 'P\(2)^.\W (Po ) Pi")- 



More generally, any Schur process can be viewed as a trajectory of a Markov chain with 
transitional probabilities given by matrices p^ and (with suitable specializations) and 
initial distribution being a Schur measure. 

Another property that we need is the following commutativity. 

Proposition 6.9. The following commutation relation on matrices p A _> M and Px->n holds: 

P t (Pi, Pi, P3)p l (pi, Pi) = P l (pi,p2)p t (pi; Ps) (6.9) 



Remark. In terms of acting on Schur measures, as in Proposition 6.8, (6.9) says that 
adding p 3 and then removing p2 is the same as first removing p2 and then adding p 3 : 

^P4iPl,P2P t (Pl? Pi] Pz) = §p 3 ,P4;pi,P2! ^P3,P4;P!,P2P J '(Pi; Pi) = ^p 3 ,P4ipl) 

S P4;pi,p 2 P ; (Pi; Pi) = § P4;pi, S P4;piP t (Pl! Ps) = $ P3 ,P4;pi- 



Proof of Proposition 6.9 We should prove that for any A, v E Y 

^2p[^(pi,Pi;P3)pi^ u ( y pi;pi) = ^2p\^(pii pi)p1^v(pii Ps)- 



Using definitions (6.5), (6.6) this boils down to the specialized version of the skew Cauchy 
Identity, which is Proposition 2.7 cf. |Borodin-ll] . □ 



Commutativity relation (6.9) paves the way to introducing a family of new 
Markov chains through a construction that we now present. This construction 
first appeared in [Di aconis-Fill-90] and was heavily used recently for probabilis- 
tic models related to Young diagrams, see [Bo rodin- Ferrari-08bj . |Borodin-Gorin-08] . 
|Borodin-Gorin-Rains-10| . |Borodin-Duits-ll j . [Bo rodin-11] . |Borodin-01shanski-12j . 
[Betea-llJ, [Bo rodin-Corwin-11] . 

Take two Schur-positive specializations pi, p 2 and a state space Y< 2 > of pairs of Young 
diagrams (^(i)) such that pf( 2 ) wi) (pi; Pi) > 0. Define a Markov chain on Y^ 2 ) with the 
following transition probabilities: 



P 



A (2) 
Ad) 



— >■ 



Pxm^M'P 



, i (pi-P2;p / )pi( 2 )^( 1) (pi;p2) 



E m p1( 2 )^ u (pi, p 2 ; pOp^d) (pi; P2) 
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In words (6.10) means that the first Young diagram \^ — > p^ evolves according to 
the transition probabilities P^^^ipi', p'), and given A^ 2 ** and p^ the distribution of 
p^ is the distribution of the middle point in the two-step Markov chain with transitions 

p!w-»a.(pi> p"> and (Pi5 to)- 

Proposition 6.10. The above transitional probabilities on map the Schur process 
with distribution 

S pl ,p 2; p-(A (2) )^ (2) ^ A{1) (pi;p 2 ) 
to the Schur process with distribution 

Sp 1 ,p 2 ;p-,p'(A (2) )^ (2) ^ A(1) (pi;p 2 )- 
Informally, the specialization p' was added to p~ and nothing else changed. 



Proof. Direct computation based on Proposition 6.9, cf. |Borodin-Ferrari-0 8b , Section 
2.2]. ' □ 

More generally, we can iterate the above constructions and produce a Markov chain 
on sequences of Young diagrams \( N \ . . . , as follows. The first Young diagram A*- 1 ) 
evolves according to transition probabilities P^i)^ (i)G°l ;/>')• Then, for any k > 2, as 
soon as p^ k ~ 1 ^ is defined and given A^ the distribution of p^ is the distribution of the 
middle point in the two-step Markov chain with transitions p^k)_+ G°i> • • • > Pk) p') an d 

p^(k-i)(pi,---,Pk-i;Pk)- 

Similarly to Proposition 6.10 one proves that one step of thus constructed Markov 
chain adds p' to the specialization p~ of the Schur process with distribution 

®Pi,..., PN ; P -(* {N) )pi(N)^ X (N-i)(Ph ■ ■ .,Pn-i]Pn) ■ ■ ■ P[(,2)^ xm (pi] P2)- (6.11) 

The above constructions might look quite messy, so let us consider several examples, 
where they lead to relatively simple Markov chains. 

Take each p^ to be the Schur-positive specialization with single non-zero parameter 
CKi = 1, and let p' be the Schur-positive specialization with single non-zero parame- 
ter Pi = b. Consider a discrete time homogeneous Markov chain (A^'(t), . . . , X^ N '(t)) 
with defined above transitional probabilities and started from the Schur process as in 
(6.11) with p~ being trivial specialization (with all zero parameters). This implies that 
(A w (0), . . . , A (Ar) (0)) = (0, . . . , 0). Note that at any time t the Young diagram A (fc) (t) 
has at most k non-empty rows and their coordinates satisfy the following interlacing 
conditions: 

x (k) > A (*-i) > A W > . . . > A (*_-i) > A « ( 6 .i2) 

In particular, A^ has a single row, i.e. it is a number. The definitions imply that the 
transitional probabilities for A^ are 

r^, ifpW = A« + i, 

pU^(pi;p') = \m> if = aw, 

^0, otherwise. 

In other words, the evolution of A^ is a simple Bernoulli random walk with probability 
of move p = 6/(1 + 6). More generally, given that \W(t) = A and A^ 1 ^ + 1) = p the 
distribution of A^(t + 1) is given by 
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Prob(A (fc) (t + l)=v\ \ {k \t) = A, A (fc - 1} (t + 1) = (i) 



Va(0;&;0)^(1;Q;Q) 

J2 ri s v/x (0;b;0)s ri/tl (l;0;0) 



b w\-w 



given that < V{ — Aj < 1, and \i and v satisfy the interlacing conditions (6.12). In 
other words, the length of each row of A'*' independently increases by 1 with probability 
6/(1 + 6) unless this contradicts interlacing conditions (in which case that length either 
stays the same or increases by 1 with probability 1). 

In order to visualize the above transitional probabilities consider N(N + 1)/2 interlac- 
ing particles with integer coordinates x{ = A^ 1 _ j — N + i, j — 1, . . . , iV, i — 1, . . . , j. In 

other words, for each j we reorder the coordinates A^ and make them strictly increasing. 
The coordinates of particles thus satisfy the inequalities 



4-1 < x i-l < 4- 



(6.13) 



Such arrays are often called Gelfand-Tsetlin patterns (or schemes) of size N and under 
this name they are widely used in representation-theoretic context. We typically use the 
notation x\ both for the location of a particle and the particle itself. It is convenient to 
put particles x\ on adjacent horizontal lines, as shown in Figure 
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Figure 19: Interlacing particles. 



Now the dynamics has the following description. At each time t each of the iV(iV + 
l)/2 particles flips a biased coin with probability of "heads" p — 6/(1 + b) (all flips are 
independent). After that positions x\{t + 1) of particles at time t + 1 are obtained by 
the following sequential update. First, we define x\(t + 1), then xf(t + 1), i — 1,2, then 
x\{t + 1), i — 1,2,3, etc. To start with, we set x\(t + 1) = x\(t) + 1, if the coin of 
the particle x\ came "heads", otherwise, x\(t + 1) = x\(t). Subsequently, once the values 
of x\(t + 1), j = 1, . . . , k — 1, % = 1, . . . ,j are determined, we define x*{t + 1) for each 
i = 1, . . . , k independently by the following procedure, in which each rule 3 is performed 
only if the conditions for the previous two are not satisfied. 

1. If i > 1 and xf(t) = x^(t + 1) - 1, then we say that particle x\ is pushed by x\_l 
and set x\(t + 1) = x*{t) + 1. 

2. If ^(i + 1) = xj(t) + 1, then we say that particle x\ is blocked by x\ 1 and set 
x k l {t + l) = x k i {t). 
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3. If the coin of the particle x\ came "heads", then we set x\(t + 1) = x\(t) + 1, 
otherwise, we set x\{t + 1) = x^(t). 

Informally, one can think of each particle having a weight depending on its vertical 
coordinate, with higher particles being lighter. The dynamics is defined in such a way 
that heavier particles push lighter ones and lighter ones are blocked by heavier ones in 
order to preserve the interlacement conditions. 

One interesting fact about this dynamics is that the joint distribution of x\{t + j) is 
the projection of the uniform distribution on the domino tilings of the so-called Aztec 
diamond, see |Nordenstam-10| . [Borodin- Ferrari-~0 8bj . A computer simulation showing 



the connection with tilings can be found on Ferrari's website [F errari] . 

Sending £» — and rescaling the time we get a continuous version of the above dy- 
namics. Formally, the process Y(t) = {Y/(t)}, t > 0, j = 1,...,N, i = is a 
continuous-time Markov chain defined as follows. Each of the N(N + l)/2 particles has 
an independent exponential clock of rate 1 (in other words, the times when clocks of 
particles ring are independent standard Poisson processes). If the clock of the particle Yf 
rings at time t, we check whether Y/ -1 (t— ) = Y/(t—) + 1. If so, then nothing happens; 
otherwise we let Y?(t) = Yj (t-) + 1. If Y?(t-) = Y^(t-) = ■■■ = Y^ k k (t-), then we 
also set Y^{t) = Y^(t-) + 1, . . . , Y>£(t) = Y>£(t-) + l. 

The Markov chain Y was introduced in [Borodin- Ferrari-08b| as an example of a 2d 
growth model relating classical interacting particle systems and random surfaces arising 
from dimers. The computer simulation of Y(t) can be found at Ferrari's website [F errari] . 
The restriction of Y to the N leftmost particles Y±, . . . , Y± is the familiar totally asym- 
metric simple exclusion process (TASEP), the restriction Y to the N rightmost particles 
Y±, . . . ,Yff is long range TASEP (or PushASEP), while the particle configuration Y(t) 
at a fixed time t can be identified with a lozenge tiling of a sector in the plane and with 
a stepped surface (see the introduction in [Borodi n- Ferrari-08b] for the details). 



One proves that the fixed time distribution of Y(t) is the Schur process of Section 6.3 
with Oj = 1 (under the above identification of particle configurations and sequences of 
Young diagrams). Moreover, the restriction of Y(t) on N leftmost particles Yj 1 , Y± . . . , Y^ 



is the same as the restriction of the dynamics of Section 6.3 and similar statement is true 



for restrictions on Y±, Y 2 2 . . . , Y$ and on Y 1 , Y^, . . . , Yff. Nevertheless, the dynamics 



Y(t) is different from that of Section 6.3 



By appropriate limit transition we can also make the state space of our dynamics 
continuous. 

Theorem 6.11 ( |Gorin-Shkolnikov- 12] ) . The law of N(N + l)/2 dimensional stochastic 
process (Y(Lt) — tV)j\fL, t > converges in the limit L — > oo to the law of a continuous 
time-space process W(t). 



Note that for N = 1 Theorem 6.11 is a classical statement (known as Donsker invari- 
ance principle) on the convergence of Poisson process towards the Brownian motion (see 
e.g. |Billingsley-95[ Section 37] or |Kallenberg-02 Section 12]). 



The process W(t) was introduced in |Warren-T)7] and has an independent probabilistic 
description: W\ is the standard Brownian motion; given W- with j < k the coordinate 
Wf is defined as the Brownian motion reflected by the trajectories W^Si (if i > 1) and 
W^~ l (if i < k) (see |Warren-"07] for a more detailed definition). The process W has 
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many interesting properties: For instance, its projection W i 



N 



h 



N, is the N- 



dimensional Dyson's Brownian motion, namely the process of N independent Brownian 
motions conditioned to never collide with each other. The fixed time distribution of 
vector Wf, i = 1,...,N, can be identified with the distribution of eigenvalues of the 
random Hermitian matrix from the Gaussian Unitary Ensemble (it was introduced after 
Definition 3.8 above), and the fixed-time distribution of the whole process W(t) is the 



so-called GUE-minors distribution, cf. |Baryshnikov-01|, [ Jo hansson-Nordenstam-06] . 



Our construction of Markov dynamics can be adapted to the measures on (skew) 

In particular, this leads to an efficient perfect sampling 



plane partitions of Section 6.2 



algorithm, which allows to visualize how a typical random (skew) plane partition looks like, 



see |Borodin-llj for details and Figure 20 for a result of a computer simulation. Further 




generalizations include, in particular, various probability distributions on boxed plane 
partitions (see |Borodin-Gorin-08] . |Borodin-Gorin-Rains-10] . |Betea-li] ]) and directed 
polymers in random media which we discuss in the next section. 

One feature that all of the above Markov chains share is that the transitional prob- 
abilities decompose into relatively simple Id distributions. In other words, each step of 
the dynamics boils down to sampling from several explicit distributions and applying not 
very complicated combinatorial rules. But in the same time, after we do several steps, we 
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arrive at highly non-trivial probabilistic objects of independent interest, such as random 
plane partitions or eigenvalues of random matrices. 



7 Macdonald polynomials and directed polymers 

In this section we generalize the constructions of Sections^ [6]based on the Schur functions 
to their (q, t) deformation known as Macdonald symmetric functions and discuss new 
applications. 



7.1 Preliminaries on Macdonald polynomials 

In what follows we assume that q and t are real numbers satisfying < g < 1, < i < 1. 

One way to define Macdonald symmetric functions P\(- ;q,t) indexed by Young dia- 
grams A is through the Gram-Schmidt orthogonalization procedure. Define the following 
scalar product on A via its values on the basis of power sums as 



£(\) 



l-q 



A,: 



i=l 



where z\, p\ are as in Theorem 2.5 



Definition 7.1. Macdonald symmetric functions P\(- ;q,t) form a unique linear basis in 
A such that 



1. The leading coefficient of P\( ■ ;q,t) (with respect to lexicographic ordering on mono- 
mials) is x^x^' 2 

2. (P x (- ;q,t),P tl (- ;q,t)) q t = 0, unless X = \i. 

We omit the dependence on (q, t) and write simply P\(-) when it leads to no confusion. 

Since A is a projective limit of algebras of symmetric polynomials in A" variables 
Xi, . . . ,Xn, the functions P\ automatically define Macdonald symmetric polynomials in 
A" variables P\(x\, . . . , x^; q, t). The latter can be characterized (for generic q, t) by being 
eigenvectors of certain difference operators. 

Definition 7.2. The rth Macdonald difference operator is defined as 

IC{l,...,N},\I\=r iel 

A I (x-,t)=fW 2 TT t J^L, 

-L -L rp . rp . 



where 



and Ti is the q -shift operator in variable X{ 

(T i f)(x 1 , ...,x N ) = f(x lt . . . , qxi, x i+1 , ...,x N ). 

In particular, 

(Pif)(xx, ■ ■ ■ ,Xn) = X% _ Xj f( Xi > ■ ■ ■,x i -i,qx i ,x i+1 , . . .,x N ). 

i=l jjti J 
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Proposition 7.3. For any N > 1, polynomials P x (xi, . . . , x^; q,t) with £(X) < N form 
a common eigenbasis of operators T>^ , r = 1, . . . , N. More precisely, 

V?P x (x u ...,x N ;q,t) = e r (q X n N ~ l , q x H N ~ 2 , q XN )P x (x 1 , . . . ,x N ;q,t), 

where e r are the elementary symmetric polynomials. In particular, 

T>?P x (x u ...,x N ;q,t) = (q x H N - x + q x H N ~ 2 + ■■■ + q XN )P x (x u . . . , x N ; q, t). 



Proposition 7.3 can be taken as an alternative definition of Macdonald polynomials. 
Both these difference operators and polynomials themselves were introduced by I. Macdon- 
ald in the late 80s |Macdonald-88] . Macdonald polynomials generalized various previously 
known classes of symmetric polynomials. When q = t we get Schur polynomials, when 
q = we get Hall-Littlewood polynomials, and if we set q = t a and send t to 1 then we get 
Jack polynomials. We follow the notations of Macdonald's book |Macdonald-95| where 
an interested reader can find proofs of various properties of Macdonald polynomials that 
we use. 

Macdonald polynomials inherit many of their properties from Schur polynomials. In 
particular, there exist Macdonald versions of skew functions and Cauchy identity which 
we used for the definition of the Schur measure (process) and for the construction of the 
dynamics in Section [6j 

Set 

n ( a- p a(- 

WAl '' g ' ] (P X (-;q,t),P x (.;q,t)) q ; 

The normalizing constant (P\(- ; q,t), P x (- ; q,t)) qt can be computed explicitly, see 
|Macdonald-95l Chapter VI]. 

The following statement is a (q, t)-analogue of the Cauchy identity: 

Proposition 7.4. Let x and y be two sets of variables. Then 

E p ^ *)Qa(w 9,t) = n f^ ;g K 

where 

oo 

(a; q)oo = -a<f). 

i=0 



Similarly to Definition 2.6 we have 

Definition 7.5. Let x and y be two sets of variables. The skew functions Px/fj, and Q x /^ 
are defined via 

Px(x,y) = Y,P\/v(x)PM> 

and 

Qx(x,y) = ^Qx/MQ^y). 

AteY 
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There is also a (q, t) analogue of the skew Cauchy identity, see |Macdonald-95[ Chapter 
VI, Section 7]. 

Another ingredient of our constructions related to Schur functions was the classification 
of positive specializations of Theorem 2.11 For Macdonald polynomials the following 
conjecture is due to Kerov: 

Conjecture 7.6. The Macdonald-positive specialization^ are parameterized by pairs of 
sequences of non-negative reals a = (cki > cti > ■ ■ ■ > 0) and f3 = > /?2 > • • • > 0) 
satisfying + A) < 00 an d an additional parameter 7 > 0. The specialization with 

parameters (a; f3; 7) can be described by its values on power sums 



Px ^ px(a; /3; 7) = + ^ \ ai + T^f ^ 

i ^ 

Pk ^ Pk(a; ^ 7) = E (a* + ("l)^ 1 ^^^) , * > 2, 
or, equivalently, via generating functions 

g k (a; /?; 7)^ = e" J^ 1 + fe)rt. 
fc=o i>i 

where gu is the one-row Q -function 

9k(-) = Q(fc)( •;?>*)• 

Remark 1. The fact that all of the above specializations are non- negative on Macdonald 
symmetric functions P\ is relatively simple, see e.g. |Boro din- Corwin- 1 1 \ Section 2.2.1]. 
However, the completeness of the list given in Conjecture |7.6| is a hard open problem 
known as Kerov's conjecture and stated in |Kerov-03l Section II. 9]. 

Remark 2. One can show that all Macdonald-positive specializations given in Conjecture 



7.6 also take non-negative values on skew Macdonald symmetric functions. 

Continuing the analogy with Schur functions we define Macdonald measures. 

Definition 7.7. Given two Macdonald-nonnegative specializations p\, p 2 , the Macdonald 
measure M Pi;P2 is a probability measure on the set of all Young diagrams defined through 

p A\ _ P\(pl)Q\{p2) 

where the normalizing constant H qt (pi\ p 2 ) is given by 

-t k Pk{pi)Pk{p2 



H q , t (pi,p2) = exp 




k 



2 A specialization p of A is called Macdonald-positive if P\{p) > for any A e Y (we assume 
<q,t < 1). 
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Remark 1. The above definition makes sense only if p 1; p 2 are such that 

yPx( Pl )Qx(p 2 )<oo, (7.1) 



A 



in which case this sum equals H qt (pi; p 2 ), as follows from Proposition 7.4 The conver- 
gence of (7.1) is guaranteed, for instance, if \pk{pi)\ < Cr k and |pfc(p 2 )| < Cr k with some 



constants C > and < r < 1. 

Remark 2. The definition of Macdonald measure was first given almost ten years ago 
by Forrester and Rains, see [Forrester- Rains-05] . In addition to the Schur (q = t) case, 
Macdonald measures (and processes) were also studied by Vuletic |Vuletic-09] for the 
Hall-Littlewood symmetric functions, which correspond to q = 0. Recently, new applica- 
tions of these measures and new tools to work with them were developed starting from 
|Borodin-CorwimTT . 



Definition 6.1 of the Schur process also has a straightforward (g, t)-analogue involving 



Macdonald polynomials. In our applications we will use only one particular case of this 



definitions which is a (g, t) generalization of measures of Section 6.3 



Definition 7.8. Given Macdonald-nonnegative specializations pf, . . . , p^ and p~ , the as- 
cending Macdonald process is the probability distribution on sequences of Young diagrams 
A^, . . . , A^ defined via 

P(A (1) \ {N) ) - Px(1) ( P ^ Px(2) / XW ( P ^ ' ' ' jP A( jv )/A( jv - 1 )(Pjv)QA(Af)(P~) 

H q ,t{Px] P -)---H q>t (p^;p-) 
7.2 Probabilistic models related to Macdonald polynomials 



The construction of Section 6.4 of dynamics preserving the class of Schur processes can 
be literally repeated for Macdonald processes by replacing all instances of skew Schur 
polynomials by skew Macdonald polynomials, see |Borodin-Corwin- 1 1 1 Section 2.3] for 
the details. 

Let us focus on one example. We set t = till the end of this section and, thus, Mac- 
donald functions are replaced with their degeneration known as g-Whittaker functions. 



We continue the analogy with Section 6.4 and study the example which led to the 



process Y{r) (we replaced the time parameter by r, since t now has a different meaning). 
Namely, we set p\ to be the specialization with single non-zero parameter <x\ — 1 in 



Definition |7.8| and p~ to be the specialization with single non-zero f3\ = b. At each step 
of our (discrete time) dynamics another /3 equal to b is added to the specialization p~. 
Sending b — > and rescaling time we arrive at the continuous time dynamics Z(t) which 
should be viewed as a g-analogue of Y(t). Let us give a probabilistic description of Z(t). 

The process Z(t) = {Zl (r)}, t > 0, j — 1, . . . , N, % = 1, . . . ,j, is a continuous time 
Markov evolution of iV(iV + l)/2 particles with its coordinates being integers satisfying 
the interlacing conditions 

ZL(r) < ZClir) < Zf(r) 

for all meaningful i, j, and defined as follows. Each of the iV(iV + l)/2 particles has an 
independent exponential clock. The clock rate of particle Z\ at time r is 

{l-q^-^-^-q^-zU^ 

1 _ Q ZHr)-ZU(,r) + l ' 1 • ' 
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in other words, this rate depends on the positions of three neighbors, as shown in Figure 



21 If one of the neighbors does not exist, then the corresponding factor disappears 
from (7.2). When the clock of particle Z\ rings at time r, we let Z\{t) = Z\{t—) + 1. 



If Z (t- 



+ 1 , 



z j+k 



T- 

7'j+k 



then we also set 



Z] 



+ 



(r) = Z£7£(t-) + 1. Equivalent^, one can think that when (|7.2l) becomes 



■-)••■) ^i+k V ' ! ^i+k 

infinite because of the denominator vanishing, the corresponding particle immediately 
moves by one to the right. Note that if Z\~ = Z{ + 1, then the rate (7.2) vanishes, 



therefore the blocking which was present in the definition of Y{f) is also implied by the 
definition of Z(t). 




o 



1 a 

» p q 

a=2 



O 



Figure 21: Left panel: The distances to three neighbors which the rate of the particle 
depends on. Right panel: jump rate for g-TASEP. 



The restriction of Z{r) to the leftmost particles Zl(r), . . . , Z^{j) is a Markovian 
dynamics known as g-TASEP (see |Borodin-Corwin- 1 1| ) . Namely, the rate of particle Z\ 

When q — > we recover the 
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at time r is 1 — q Zl ^ 1 , as shown in Figure 

familiar TASEP. 

There is one distinction from Y(t), namely, the restriction of Z{j) on the rightmost 
particles is no longer a simple Markov process. 

There are various interesting limit transitions as r — > oo and q — > 1. Let us concentrate 
on one where a phenomenon known as crystallization appears. Let 

g = exp(-e), r = \, 

and send e to 0. In this limit, particles Z\(t) approximate a perfect lattice, namely, 

/ \ t In e , 

zi(T)*-2-— (2*-j-i). 

The fluctuations around the points of this lattice are of order More precisely, the 
following theorem holds. 

Theorem 7.9. Let 

Z>{t,e)=e(zi(T)-± + —{2i-j-l)), g = exp(-e), r * 



e 2 e I e 2 



Then as e — > 0, the stochastic process Z(r,e) weakly converges to a certain random vector 
T(t). 



The proof of Theorem 7.9 can be found in |Borodin-Corwin-ll] . The fixed time dis- 
tribution of T/(t) is known as the Whittaker process (first introduced in |Q , Connell-12j ) 
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and it is related to yet another class of symmetric functions (which are no longer polyno- 
mials) that can be obtained from Macdonald symmetric functions through a limit tran- 
sition. These are class one jj^-Whittaker functions, see e.g. |Kostant-77j . |Etingof^99 , 
[Gerasimov-Lebedev-Oblezin-11] . 

The stochastic dynamics T- (t) can be also independently defined via certain relatively 
simple stochastic differential equations, see |Q , Connell-12j . 

The interest in T/(t) is caused by its remarkable connections with directed polymers 
found by O'Connell and Yor jO^onnell-YTFOl] . |()'Connell-12] . 

Theorem 7.10. Consider N independent Brownian motions Bi(s) ; . . . ,Bn{s). The dis- 
tribution of exp(T 1 Ar (t)) coincides with that of the N — 1 -dimensional integral 



I exp ( 



.61(0, si) + B 2 (s 1 , s 2 ) H h B N (s N ^i, t) ) dsi ■ ■ ■ ds N -i, 



(7.3) 



0<si<S2<---<sjv-i<t 



where 



Bi(a,b) = Bi(b) - Bi(a) 



The integral (7.3 ) is known as the partition function of the O'Connell- Yor semidiscrete 
polymer. This has the following explanation. Identify each sequence < Si < s 2 < ■ ■ ■ < 
Sjv-i < t with a piecewise-constant monotonous function with unit increments at or, 
equivalently, with staircase-like path ("directed polymer") joining (0,1) and (t,N), as 
Further, we view Bi(a,b) as an integral of the Id white noise along 
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shown in Figure 
the interval (a, b). Then the sum 

£i(0, si) + B 2 {s x , 8 2 ) + ■ ■ ■ + B N (s N -i,t) 

turns into the integral of space-time white noise along the path defined by < s% < s 2 < 
■ ■ ■ < Sjv_i < t. Integrating over all possible paths we arrive at the partition function, see 
[Borodin- Corwin- 11] . |Borodin-Corwin-Ferrari-T2] and references therein for more details. 



5 
4 



1? 



3 
2 



Figure 22: Staircase-like path corresponding to the sequence < S\ < s 2 < ■ ■ ■ < s^-i < 
t. Here iV = 5. 
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7.3 Analysis of Macdonald measures 

In the previous section we explained that the formalism of Macdonald processes leads to 
quite interesting probability models, but we do not know yet how to analyze them. The 
methods we used for the Schur processes were based on the determinantal point processes. 
Similar determinantal structure is not known to exist for the Macdonald processes and, 
probably, there is no such structure at all. However, there is another approach based on 
the Macdonald difference operators. 

We start from the (q, t)-Cauchy identity of Proposition 7.4 in the form 

N 

,x N ',q,t)Q x (Y;q,t) = Y[H(x i ;Y), 



AeY 



(7.4) 



where 



U(x 



(txyj;g)c 



exp 




t k p k {Y)x k 
q k k 



Let us apply the Macdonald difference operator T> r N to both sides of (7.4) with respect to 
the variables x%, . . . , x^. Proposition |7.3| yields 



e r (q x H N -\ . . . , g A -)P A (x 1? ...,x N ;q, t)Q x (Y; q, t) = V' N ( J] Tl( Xl ; YU . 
AeY \i=i / 

We immediately arrive at the following statement. 

Proposition 7.11. Let A G Y be distributed according to the Macdonald measure M pi P2 , 
where p\ is the specialization with finitely many a-parameters a\, . . . , ajy- Then for r = 
1,...,N 

(ntinfa', /»)] 



E e r (q Xl t 



Xij.N-1 



nii n (^;P2) 

- t k Pk(p)x k 



where 



U(x; p) = exp 




(7.5) 



Xi=ai, Ki<N, 



— q K 



Remark. Clearly, we can replace T> r N with any product of such operators, similarly 



replacing e r (q Xl t N 1 , 
be valid. 



q N ) with the corresponding product, and the statement will still 



The remarkable property of Proposition 7.11 is that while the Macdonald polynomials 
themselves are fairly mysterious objects with complicated definition, the right-hand side 
of (7.5) is explicit. Therefore, we have the formulas for averages of certain observables 
of Macdonald measures. These formulas can be compactly written via contour integrals. 
Let us present one particular case of such formulas. 

Theorem 7.12. Assume that, in the notations of Proposition 7.11\ t = 0, all parameters 
dj are equal to 1, and p 2 is the Macdonald-positive specialization with single non-zero 
parameter 7 = r. In other words, we deal with probability measure 



e-" T P x (l,...,l;q,0)Q x ((0;0;T);q,0). 
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Then 



Eg 



■l) fc g 



fc(fc-i) 



(27ri) fc 



n 



za - zb 
za - qzB 



e (q-l)r Zj dz . 



Z j) N Z j 



A<B * " j=l 

where the integration is over the nested contours such that Zj -contour contains 1 and also 
QZj+u 



,qzk, and no other singularities of the integrand, see Figure 23 



Remark 1. The measure of Theorem 7.12 via the identification of Young diagrams 
with iV-rows and TV-point particle configurations coincides with the distribution of the 
vector Zy{r\ 
Remark 



r) of the stochastic dynamics of Section 7.2 



t can be non-zero, 



2. Theorem |7.12| admits various generalizations: 
both specializations can be arbitrary, we can compute expectations related to the 
higher order Macdonald operators and also apply it to the joint distributions of 
various Young diagrams of the Macdonald process. See |Borodin-Corwin-ll| . and 
[B or o din- Corwin- Gorin-Shakirov-T3] for details. 




Figure 23: Nested contours of integration. Here iV 
contour and z 3 is integrated over the smallest one. 



3, Z\ is integrated over the largest 



The moments of g Ajv can be combined into a g-Laplace transform for which we can 
also get a neat expression in the form of a Fredholm determinant. 



Theorem 7.13. In the notations of Theorem 7.12, for all £ e C\ 1R + we have 

1 



E 



(Cg AiV ;g)c 



det(I + K ( ) L 2 (Cuj) , 



where C u is a positively oriented small circle around 1, and the operator is an integral 
operator defined in terms of its kernel 



K((w, w') 



2vri 



ioo+1/2 



ioo+1/2 



r(- s )r(i + s)(-c)V(9')ds 1 



where 



(w;g)oc 

The operator is trace- class for all ( G C \ R. 



q b w — w 



N 



exp I Tw(q s 
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The appearance of the Fredholm determinant in Theorem 7.13 is unexpected. As 
we already mentioned, there is no known structure of determinantal point process 
for the Macdonald measure. The computation which leads to this determinant (see 



[Borodin- Corwin- 1 1| for the proof) turns out to be parallel to the "shift contour argu- 



ment" in the harmonic analysis on Riemannian symmetric spaces, which goes back to 
Helgason and Heckman-Opdam, see |Heckman-Opdam-97 and references therein. The 



main step of the proof is the modification of contours of integration in Theorem 7.12 and 
careful residue book-keeping. In the end all the residues miraculously combine into a 
single Fredholm determinant, which should be viewed as a manifestation of the fact that 
we are working in the representation-theoretic framework. 



Through the limit transition described in Section 7.2 we can now get some information 



about the partition function of the semi-discrete O'Connell-Yor polymer. Namely, the 



limit of Theorem 7.12 gives moments of this partition function and the limit of Theorem 
7.13 gives the expectation of its Laplace transform, see |Borodin-Corwin-llj for an exact 
statement. 

Interestingly enough, the argument relating the Laplace transform to the moments no 
longer works after the limit transition. The desired relation between the moments and 
the Laplace transform of a random variable £ is 

H u -w =S^- (7 - 6) 

fc=0 / fc=0 

However, the moments of the partition function of the O'Connell-Yor directed poly- 
mer grow rapidly (as e k , cf. [Bo rodin-Corwin-12j ) and the series in the right side of 



(7.6) does not converge for any ti / 0. This is caused by the intermittency which we 
discussed in Section [TJ Similar things happen when one considers fully continuous poly- 
mer which we briefly mentioned in Section [TJ i.e. when one integrated 2d white noise 
over the paths of Brownian bridges. Nevertheless, physicists tried to overcome these 
difficulties and find the Laplace transform (and the distribution itself after that) using 
the moments (the latter could be computed in this model using the so-called Bethe 
ansatz for the delta quantum Bose-gas). A non-rigorous argument used here is known 
as replica trick and it has a long history (first applied to directed polymers in random 
media in |Kardar-87] ) ; this is some sort of an analytic continuation argument for the 
function with specified values at integer points. However, the first answers to this prob- 
lem obtained via the replica trick were incorrect. (They were later corrected though, cf. 
|Dotsnenko-10j . |Calabrese-Doussal-Rosso-10] .) A correct mathematical computation of 
the Laplace transform for the continuous directed polymer appeared concurrently in the 
work of Amir-Corwin-Quastel | Amir- Corwin- Quast el- 1 l] (which is fully rigorous) and, 
independently, of Sasamoto-Spohn |Sasamoto-Spohn-10| ; it was based on previous work 
of Tracy-Widom on ASEP, see |Tracy-Widom-l lj and references therein. 



Therefore, the manipulations with contours which deduce Theorem 7.13 from Theorem 



7.12 can be viewed as a fully rigorous g-incarnation (put it otherwise, as a mathematical 



justification) of the non-rigorous replica trick. See |Borodin-Corwin-Sasamoto-12] for 
further developments in this direction. 

The asymptotic analysis of the operator K in the Fredholm determinant of Theorem 
7.13 based on the steepest descent ideas that we discussed in Section [5j leads to the 
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Tracy-Widom distribution F 2 (s) (which is also given by Fredholm determinant, as we 
remarked after Theorem 5.6). Along these lines one proves the KPZ-universality for the 
partition function of the O'Connell-Yor directed polymer, and also of certain integrable 
discrete polymers (Theorem 1.5), see |Borodin-Corwin-l lj . |Borodin-Corwin-Ferrari-T2] . 
|Borodin-Corwin-Remenik7[ 2j . [Bo rodin-Corwin-Sasamoto-12] . 
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